All Topics

Technology

Art

Microsoft Launches MAI-Voice-1 Speech Generation Model with Sub-Second Audio Processing

Chris Messina

9mo ago· 5 min readenProduct

100/100

Golden Brown

Bagelometer↗

If you only eat one bagel today, this is the bagel.

Score100TypenewsSentimentpositive

Summary

Microsoft has launched MAI-Voice-1, a highly efficient speech generation model that can generate a full minute of audio in under a second on a single GPU. This represents the second launch from Microsoft Copilot, which aims to incorporate web context, work data, and real-time PC activity to provide better AI assistance. The technology is positioned as a competitor to ElevenLabs and will be available across Windows 11, Microsoft 365, Edge browser, and Bing.

Key quotes

· 4 pulled

MAI-Voice-1 is a lightning-fast speech generation model, with an ability to generate a full minute of audio in under a second on a single GPU

Microsoft coming after @ElevenLabs — /me grabs popcorn

Wow, that speed is absolutely mind-blowing. Beyond efficiency

Copilot will uniquely incorporate the context and intelligence of the web, your work data and what you are doing in the moment on your PC to provide better assistance

Snippet from the RSS feed

Copilot will uniquely incorporate the context and intelligence of the web, your work data and what you are doing in the moment on your PC to provide better assistance. Available in Windows 11, Microsoft 365, and in our web browser with Edge and Bing.

You might also wanna read

Microsoft Launches First In-House AI Models MAI-Voice-1 and MAI-1-preview

Microsoft has launched its first in-house AI models called MAI-Voice-1 and MAI-1-preview. The MAI-Voice-1 speech model can generate a minute

The Verge·9mo ago

Microsoft Launches First In-House AI Image Generator MAI-Image-1

Microsoft has launched its first in-house AI image generator, MAI-Image-1, which is now available in Bing Image Creator and Copilot Audio Ex

The Verge·6mo ago

Microsoft AI Launches First In-House Text-to-Image Generator MAI-Image-1

Microsoft AI has announced MAI-Image-1, its first in-house developed text-to-image generator. The company describes this as "the next step o

The Verge·7mo ago

Microsoft Open-Sources VibeVoice: A Speech-to-Text AI for Long-Form Audio Transcription

Microsoft has open-sourced VibeVoice, a frontier voice AI system that includes VibeVoice-ASR, a unified speech-to-text model capable of hand

GitHub·1mo ago

Microsoft Expands AI Integration in Windows 11 with Copilot Voice and Vision Features

Microsoft is advancing its AI integration strategy for Windows 11, positioning every PC as an AI-powered device controlled by Copilot. The c

The Verge·7mo ago

Microsoft Launches Maia 200 AI Accelerator Chip to Compete with Amazon and Google

Microsoft announces the Maia 200, its latest in-house AI accelerator chip built on TSMC's 3nm process. The chip features over 100 billion tr

The Verge·4mo ago