Google Adds AI Music Generation to Gemini App Using DeepMind's Lyria 3 Model
By
Jess Weatherbed
A bagel you'd recommend to a friend without hedging.
Summary
Google has integrated DeepMind's Lyria 3 AI audio model into its Gemini app, enabling users to generate 30-second AI music tracks based on text, images, and video prompts. The feature is now available globally in multiple languages for users 18+, with plans for future expansion. This represents Google's latest move in the competitive AI music generation space.
Key quotes
· 5 pulledGoogle has given Gemini the ability to spit out AI-generated music, courtesy of DeepMind's latest audio model.
Beta access to Lyria 3 is rolling out in the Gemini app, enabling users to generate 30-second tracks based on text, images, and videos, without having to leave the chatbot window.
The new music-making tool is available globally starting today in English, German, Spanish, French, Hindi, Japanese, Korean, and Portuguese, with plans to expand in the future.
Access is limited to Gemini app users who are 18 years or older.
Lyria 3's text-to-music capabilities allow Gemini app users to make s
You might also wanna read
Google DeepMind's Lyria 3 Music Generation Model Now Available in Gemini App Beta
Google DeepMind's Lyria 3 music generation model is now integrated into the Gemini app, allowing users to create 30-second songs from text p
Google DeepMind's Lyria 3 Pro Enables Structured AI Music Creation in Gemini App
Google DeepMind's Lyria 3 Pro in the Gemini app enables users to create AI-generated music tracks up to 3 minutes long with structured music
Google DeepMind's Lyria Camera: Transform Your Camera into a Musical Instrument Using Visual AI
Google DeepMind's Lyria Camera is an innovative product that transforms smartphone cameras into musical instruments. Using Gemini's visual u
Google Launches Gemini 3.1 Flash TTS API with Natural Language Voice Direction and Multi-Speaker Dialogue
Google has launched a text-to-speech API called Gemini 3.1 Flash TTS that features natural language voice direction, inline audio tags, mult
Google Launches Gemini AI with Interactive 3D Visualizations and Simulations
Google has launched Gemini, its largest and most capable AI model that is multimodal and can understand and operate across text, images, aud
Google Unveils Gemini: A Multimodal AI Model to Rival GPT-4
Google's Gemini is introduced as its largest and most capable AI model, designed to be multimodal and capable of understanding and combining
