Technology

Art

Boson AI Releases Higgs Audio v3 TTS: Expressive Multilingual Speech Model with Voice Cloning

14d ago· 9 min readen

technology science open source ai/ml

Summary

Boson AI has released Higgs Audio v3 TTS, a text-to-speech model designed for voice chat applications. It converts model responses into expressive conversational speech across 100+ languages, featuring zero-shot voice cloning and inline control over emotion, style, prosody, pauses, and sound effects. The model is released for research and non-commercial use under a specific license, with commercial use requiring a separate license. The release is part of Boson AI's mission to advance and democratize AI through open source and open science.

Source

bskyBoson AI Releases Higgs Audio v3 TTS: Expressive Multilingual Speech Model with Voice Cloninghuggingface.co

Key quotes

· 3 pulled

Higgs Audio v3 TTS is built for voice chat: it speaks, not just reads.

It turns model responses into expressive conversational speech across 100+ languages, with zero-shot voice cloning and inline control over emotion, style, prosody, pauses, and sound effects.

We're on a journey to advance and democratize artificial intelligence through open source and open science.

Snippet from the RSS feed

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

You might also wanna read

Higgsfield Launches Speak 2.0 AI Platform for Realistic Avatar Video Creation

Higgsfield has launched Speak 2.0, an AI-powered platform that enables creators to generate motion-driven talking videos with realistic emot

Product Hunt·9mo ago

Kitten TTS: A Lightweight 25MB AI Voice Model for CPU-Based Speech Synthesis

The article introduces Kitten TTS, a groundbreaking 25MB AI voice model that operates efficiently on CPUs without requiring GPUs or expensiv

algogist.com·10mo ago

ElevenLabs AI Text-to-Speech Platform Creates Natural Voices in Any Language

ElevenLabs offers AI-powered text-to-speech and voice cloning software that creates natural-sounding voices in any language. The platform pr

Product Hunt·1y ago

ElevenLabs: AI-Powered Text-to-Speech and Voice Cloning Software

ElevenLabs offers advanced AI-powered text-to-speech and voice cloning software, providing lifelike and natural voices for creators and publ

Product Hunt·10mo ago

ElevenLabs: AI-Powered Text-to-Speech and Voice Cloning Software

ElevenLabs offers advanced AI-powered text-to-speech and voice cloning software, providing lifelike and natural voices for creators and publ

Product Hunt·10mo ago

Real-Time Voice Cloning Implementation Using SV2TTS Deep Learning Framework

This repository implements a real-time voice cloning system called SV2TTS (Transfer Learning from Speaker Verification to Multispeaker Text-

github.com·9mo ago

Comments

No comments yet. Be the first.