All Topics

Technology

Art

Microsoft Launches First In-House AI Models MAI-Voice-1 and MAI-1-preview

Emma Roth

9mo ago· 2 min readenNews

75/100

Toasty

Bagelometer↗

A respectable bake. You'd come back tomorrow for another.

Score75TypenewsSentimentpositive

Summary

Microsoft has launched its first in-house AI models called MAI-Voice-1 and MAI-1-preview. The MAI-Voice-1 speech model can generate a minute of audio in under one second using a single GPU and is already powering features like Copilot Daily's AI news host and podcast-style explanations. The models represent Microsoft's effort to compete with OpenAI despite its significant investments in that company.

Key quotes

· 4 pulled

Microsoft's AI division announced its first homegrown AI models on Thursday: MAI-Voice-1 AI and MAI-1-preview

The company says its new MAI-Voice-1 speech model can generate a minute's worth of audio in under one second on just one GPU

MAI-1-preview 'offers a glimpse of future offerings inside Copilot'

Microsoft already uses MA1-Voice-1 to power a couple of its features, including Copilot Daily, which has an AI host recite the day's top news stories

Snippet from the RSS feed

Microsoft has launched two new AI models that put it in the running to compete with OpenAI, which it has poured billions into.

You might also wanna read

Microsoft Launches MAI-Voice-1 Speech Generation Model with Sub-Second Audio Processing

Microsoft has launched MAI-Voice-1, a highly efficient speech generation model that can generate a full minute of audio in under a second on

Product Hunt·9mo ago

Microsoft Launches MAI-Transcribe-1: Multilingual Speech-to-Text Model for Production Use

Microsoft has launched MAI-Transcribe-1, a new multilingual speech-to-text model designed for production use. The model offers best-in-class

Product Hunt·1mo ago

OpenAI Launches GPT-Realtime Model and Voice API for Advanced Voice Agent Development

OpenAI has released its gpt-realtime model and Realtime API, which represent a significant advancement in voice AI technology. The key innov

Product Hunt·9mo ago

OpenAI Releases Realtime API with Production Voice Agent Features and Advanced GPT-Realtime Model

OpenAI has made its Realtime API generally available with new production-ready features for voice agents, including support for remote MCP s

openai.com·9mo ago

OpenAI Launches GPT-Realtime Model for Advanced Voice Agent Capabilities

OpenAI has released its gpt-realtime model, which represents a significant advancement in voice agent technology. The key innovation is that

Product Hunt·9mo ago

Microsoft Open-Sources VibeVoice: A Speech-to-Text AI for Long-Form Audio Transcription

Microsoft has open-sourced VibeVoice, a frontier voice AI system that includes VibeVoice-ASR, a unified speech-to-text model capable of hand

github.com·1mo ago