Skip to main content
The Keyword
Gemini Embedding 2: Our first natively multimodal embedding model
["How is Gemini changing Maps?", "What is \"vibe design?\"", "How can I learn new AI skills?"]

Gemini Embedding 2: Our first natively multimodal embedding model

Gemini Embedding 2
Listen to article
This content is generated by Google AI. Generative AI is experimental
[[duration]] minutes
Gemini embedding 2 benchmarks
"Empowering our teams to seamlessly search past and present content has increasingly driven us to vector search. While initially seeing great results with traditional large text embeddings (3,072 dim), crowding in vector space quickly took over; the right results couldn't reliably surface their way up from the noise. Gemini's new Embedding 2 model completely changed the game. Text queries can now pinpoint untranscribed micro-expressions, and we can even leverage existing media, such as a photo or B-roll clip, as the search input to instantly retrieve matching video assets. This propelled our text-to-video Recall@1 rate to 85.3%." Seth Georgian, VP Technology Innovation, Paramount Skydance
"We chose Gemini embeddings to help legal professionals find critical information during the discovery process in litigation -- a highly technical challenge in a high-stakes setting, and one Gemini excels at. In our most recent tests, Gemini's multi-modal embedding model improves precision and recall across millions of records, while unlocking powerful new search functionality for images and videos. For legal professionals, these new capabilities open up entirely novel ways to quickly understand case materials in even the largest matters." Max Christoff, CTO at Everlaw
"Gemini Embedding 2 is the foundation for Sparkonomy’s Creator Economic Equality Engine. Its native multi-modality slashes our latency by up to 70% by removing LLM inference and nearly doubles semantic similarity scores for text-image and text-video pairs—leaping from 0.4 to 0.8. This powers our proprietary Creator Genome to index millions of minutes of video, alongside images and text, with unprecedented precision—unlocking unbiased brand collaborations and democratizing economic success for every creator." Guneet Singh, Co-founder at Sparkonomy
"The API continuity is excellent. Gemini Embedding 2 drops right into our existing workflow with minimal changes. We’re testing new ways to embed text-based conversational memories together with audio and visual embeddings, especially assistant question-and-answer pairs, and seeing a 20% lift in top-1 recall for our personal wellness app."  Ertuğrul Çavuşoğlu, Co-founder at Mindlid

Let’s stay in touch. Get the latest news from Google in your inbox.

Subscribe