All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Inworld launches TTS-2 with cross-lingual synthesis and natural language voice control

By

Aleksey Tikhonov

1mo ago· 3 min readenProduct

Summary

Inworld announces TTS-2, the successor to their #1 ranked text-to-speech model (TTS 1.5), featuring six major upgrades including natural language voice direction, text-based voice design, cross-lingual synthesis across 100+ languages, IPA phonetic control, and improved pronunciation. The company offers a unified API platform combining speech-to-text, LLM routing, and top-ranked TTS for developers building voice agents, AI companions, and conversational applications.

Key quotes

· 5 pulled
Realtime TTS 1.5 is #1 on Artificial Analysis, voted best in blind tests by thousands of real users.
TTS-2 builds on that with six major upgrades: natural language voice direction for tone, emotion, speed, and pitch.
Cross-lingual synthesis across 100+ languages preserving speaker identity.
One platform with speech-to-text, an LLM router, and the top-ranked text-to-speech, all connected on a single API so context flows between every layer.
Used by developers building voice agents, AI companions, and conversational apps.
Snippet from the RSS feed
Inworld builds the infrastructure for production voice AI. One platform with speech-to-text, an LLM router, and the top-ranked text-to-speech, all connected on a single API so context flows between every layer. Used by developers building voice agents, AI

You might also wanna read