openai.fm
11mo agoCode
Source
Reference implementation for speech-related applications. — audio
You might also wanna read
OpenAI launches three new audio models for real-time voice applications in the API
OpenAI is introducing three new audio models in its API that enable developers to build more natural, intelligent, and real-time voice appli
Platform for creating accessible and multilingual itineraries with voice assistant
ema.europa.eu·4mo ago
Neural Audio Codecs: Bridging the Gap Between Language Models and Audio Processing
This article explores the technical challenge of integrating audio directly into large language models (LLMs) using neural audio codecs. It
How OpenAI rebuilt its WebRTC stack for low-latency voice AI at scale
OpenAI rearchitected its WebRTC stack to address three key constraints for real-time voice AI: low-latency audio delivery, global scale, and
Microsoft Open-Sources VibeVoice: A Speech-to-Text AI for Long-Form Audio Transcription
Microsoft has open-sourced VibeVoice, a frontier voice AI system that includes VibeVoice-ASR, a unified speech-to-text model capable of hand
VoiceAI: A Developer's Learning Path for Building Real-Time Voice Agents
A curated, developer-friendly learning path for building real-time voice AI agents, covering the full stack from speech-to-text foundations

Comments
Sign in to join the conversation.
No comments yet. Be the first.