FeedBagel

All Topics

Art

openai.fm

11mo agoCode

Source

OpenAIopenai.fmgithub.com

Snippet from the RSS feed

Reference implementation for speech-related applications. — audio

You might also wanna read

OpenAI launches three new audio models for real-time voice applications in the API

OpenAI is introducing three new audio models in its API that enable developers to build more natural, intelligent, and real-time voice appli

openai.com·25d ago

Platform for creating accessible and multilingual itineraries with voice assistant

ema.europa.eu·4mo ago

Neural Audio Codecs: Bridging the Gap Between Language Models and Audio Processing

This article explores the technical challenge of integrating audio directly into large language models (LLMs) using neural audio codecs. It

kyutai.org·8mo ago

How OpenAI rebuilt its WebRTC stack for low-latency voice AI at scale

OpenAI rearchitected its WebRTC stack to address three key constraints for real-time voice AI: low-latency audio delivery, global scale, and

OpenAI·2mo ago

Microsoft Open-Sources VibeVoice: A Speech-to-Text AI for Long-Form Audio Transcription

Microsoft has open-sourced VibeVoice, a frontier voice AI system that includes VibeVoice-ASR, a unified speech-to-text model capable of hand

GitHub·2mo ago

VoiceAI: A Developer's Learning Path for Building Real-Time Voice Agents

A curated, developer-friendly learning path for building real-time voice AI agents, covering the full stack from speech-to-text foundations

GitHub·2mo ago

Comments

No comments yet. Be the first.