Microsoft Launches First In-House AI Models MAI-Voice-1 and MAI-1-preview
By
Emma Roth
A respectable bake. You'd come back tomorrow for another.
Summary
Microsoft has launched its first in-house AI models called MAI-Voice-1 and MAI-1-preview. The MAI-Voice-1 speech model can generate a minute of audio in under one second using a single GPU and is already powering features like Copilot Daily's AI news host and podcast-style explanations. The models represent Microsoft's effort to compete with OpenAI despite its significant investments in that company.
Key quotes
· 4 pulledMicrosoft's AI division announced its first homegrown AI models on Thursday: MAI-Voice-1 AI and MAI-1-preview
The company says its new MAI-Voice-1 speech model can generate a minute's worth of audio in under one second on just one GPU
MAI-1-preview 'offers a glimpse of future offerings inside Copilot'
Microsoft already uses MA1-Voice-1 to power a couple of its features, including Copilot Daily, which has an AI host recite the day's top news stories
You might also wanna read
Microsoft Launches MAI-Voice-1 Speech Generation Model with Sub-Second Audio Processing
Microsoft has launched MAI-Voice-1, a highly efficient speech generation model that can generate a full minute of audio in under a second on
Microsoft Launches MAI-Transcribe-1: Multilingual Speech-to-Text Model for Production Use
Microsoft has launched MAI-Transcribe-1, a new multilingual speech-to-text model designed for production use. The model offers best-in-class
OpenAI Launches GPT-Realtime Model and Voice API for Advanced Voice Agent Development
OpenAI has released its gpt-realtime model and Realtime API, which represent a significant advancement in voice AI technology. The key innov
OpenAI Releases Realtime API with Production Voice Agent Features and Advanced GPT-Realtime Model
OpenAI has made its Realtime API generally available with new production-ready features for voice agents, including support for remote MCP s
OpenAI Launches GPT-Realtime Model for Advanced Voice Agent Capabilities
OpenAI has released its gpt-realtime model, which represents a significant advancement in voice agent technology. The key innovation is that
Microsoft Open-Sources VibeVoice: A Speech-to-Text AI for Long-Form Audio Transcription
Microsoft has open-sourced VibeVoice, a frontier voice AI system that includes VibeVoice-ASR, a unified speech-to-text model capable of hand
