Microsoft Open-Sources VibeVoice: A Speech-to-Text AI for Long-Form Audio Transcription

Open-Source Frontier Voice AI. Contribute to microsoft/VibeVoice development by creating an account on GitHub.

tosh2mo ago6 min readenCode

You might also wanna read

OpenAI's Whisper, released in 2022, became a widely adopted speech recognition model by converting audio into log-Mel spectrograms — image-l

AI voice dictation that runs entirely on your Mac. No cloud, no subscription, no data leaving your device. On-device Whisper and NVIDIA Para

Vibe is an AI language learning app where beginners can speak from day one. Match with AI friends based on your interests and call them to p

Meet the new Microsoft Copilot, your AI companion that remembers details (Memory), takes action (Actions), sees your world (Vision), and mor

A new guide outlines a robust architectural blueprint for an inbound clinic appointment assistant, "Claudia," designed to overcome common la

A developer building the open-source audio analysis library 'audiotrace' needed a way to label speakers in call transcripts without requirin

No comments yet. Be the first.