Google Releases Gemini Embedding 2: First Natively Multimodal Embedding Model
By
Rohan Chaubey
There's a fresh bagel in here somewhere. We couldn't find it.
Summary
Google has released Gemini Embedding 2, its first natively multimodal embedding model that can map text, images, video, audio, and documents into a single embedding space, enabling cross-media retrieval and classification. The model is currently available in public preview.
Key quotes
· 4 pulledGemini Embedding 2 is Google's first natively multimodal embedding model
maps text, images, video, audio and documents into a single embedding space
enabling multimodal retrieval and classification across different types of media
available now in public preview
You might also wanna read
Google Gemini Omni: Multimodal AI That Processes Video, Audio, Images, and Text Simultaneously
Google's Gemini Omni is a new multimodal AI model that can process and generate content across video, audio, images, and text simultaneously
Gemini API File Search expands to support multimodal RAG with image and text processing
Google has expanded the Gemini API's File Search tool to support multimodal data (text and images) in retrieval-augmented generation (RAG) s
Google Launches Gemini 2.5 Flash Image: Advanced AI Image Generation and Editing Model
Google has launched Gemini 2.5 Flash Image, a new state-of-the-art image generation and editing model that enables blending multiple images,
Google launches Gemini 3.5 with agentic AI capabilities and 2M token context window
Google has released Gemini 3.5, a new series of AI models that combine frontier-level intelligence with the ability to take actions in the r

Google Launches Gemini 3 AI Model with Enhanced Coding and Visualization Capabilities
Google is launching Gemini 3, its latest and most advanced AI model series, positioning it as the company's 'most intelligent' and 'factuall
Google Releases Gemini 2.5 Computer Use Model for UI Interaction
Google has released the Gemini 2.5 Computer Use model, a specialized AI model built on Gemini 2.5 Pro that enables agents to interact with u
