Google unveils Gemini Embedding 2, its first multimodal embedding model

Google has officially unveiled its first-ever multimodal embedding model, the Gemini Embedding 2. While AI started with being limited to text-only, with the help of Gemini Embedding 2, Google is planning to map text, images, videos, audio and documents into a single space. With the model, Google wants to simplify complex pipelines and enhance a wide variety of multimodal downstream tasks as it supports retrieval-augmented generation (RAG) and semantic search, from sentiment analysis to data clustering. Here’s a detailed look at how this new embedding model works and how you can use it.

Digit.in

Survey

✅ Thank you for completing the survey!
${q.options.map(opt => ` `).join(“”)}

`; // trigger animation const inner = qaContainer.querySelector(“.qa-inner”); setTimeout( => inner.classList.add(“show”), 50); document.querySelectorAll(“input[name=’answer’]”).forEach(radio => { radio.addEventListener(“change”, => submitAnswer(radio.value)); });}function showThankYou { thankYouBox.style.display=”block”; setTimeout( => thankYouBox.classList.add(“show”), 50);}function submitAnswer(answerValue) { const finalPayload = { campaign_name: “Digit Questionaire”, form_name: “Digit Questionaire_Form 1”, form_data: [ { key: “uuid”, value: deviceId, type: “text” }, { key: “question”, value: questions[currentQuestionIndex].question, type: “text” }, { key: “response”, value: answerValue, type: “text” } ], verification_data: { captcha_verification: “”, captchaValue: null, captchaId: null, phone_verification: false, email_verification: false }, meta_data: { referer: document.referrer || window.location.href, user_agent: getDeviceType } }; const myHeaders = new Headers ; myHeaders.append(“accept”, “*/*”); myHeaders.append(“origin”, “https://www.timesdrive.in”); myHeaders.append(“user-agent”, navigator.userAgent); const formdata = new FormData ; formdata.append(“finalPayload”, JSON.stringify(finalPayload)); fetch(“https://apivelocitynext.tnn.in/submit-form-data/68b6d8aac3aa7094b919ac4f/68b6d8f5c3aa7094b919ac89”, { method: “POST”, headers: myHeaders, body: formdata }) .then(res => res.text ) .then(result => console.log(“Submitted:”)) .catch(err => console.error(“Error:”)); currentQuestionIndex++; progress++; setCookie(“progress”, progress); if (currentQuestionIndex { qaBox.style.display=”none”; showThankYou ; }, 600); }}if (currentIndex qaBox.classList.add(“show”), 1200); showQuestion(currentIndex);}

OpenAI to soon integrate Sora AI video tool into ChatGPT: Report

Gemini Embedding 2: How does it work?

First up, speaking of how the model works, Google explained that this new model is based on Gemini. As per them, it leverages its best-in-class multimodal understanding capabilities to create high-quality embeddings across various media.

These media include text-based media that support a context of up to 8192 input tokens. In terms of images, the model is capable of processing up to 6 images per request, supporting both the popular PNG and JPEG formats.

Videos are where things get interesting, as the model supports up to 120 seconds of video input in both MP4 and MOV formats. The model can natively input and embed audio data without needing intermediate text transcriptions, and even directly embed PDFs that are up to 6 pages.

The best part is that this model has been built such that it can process more than one medium at a time. This model can pass multiple media, like image + text, in a single request. As per Google, this would allow the model to work between different media types, unlocking a better understanding of real-world data.

Improvements over previous models

Google also shared the performance difference over the various multimodal models available in the space. As per them, with Gemini Embedding 2, Google is not only improving from their legacy models, but they are also establishing a new performance standard when compared to the other models.

They shared this table, detailing the performance improvements compared to the other models below:

Gemini Embedding 2

Using Google’s new Gemini Embedding 2 multimodal embedding model is pretty simple, too. You can just head on over to either the Gemini API or the Vertex API platform and check it out from there. On their official blogpost, Google has released the code required to access the model.

Pentagon turns to Google Gemini AI assistants after Anthropic dispute

Latest

Pennsylvania sues AI company, saying its chatbots illegally hold themselves out as licensed doctors

Pennsylvania sues AI company, saying its chatbots illegally hold themselves out as licensed doctors

Samsung may have leaked Galaxy Z Fold 8 and new Wide Fold variant ahead of launch

Samsung’s Galaxy Z Fold 8 and a new ‘Wide Fold’ variant have been spotted in One UI 9. Leaks suggest a wider design. Here is what we know so far.

Roomba creator builds robot dog for pets and cuddles, uses AI to give it emotions

Colin Angle has unveiled Familiar Machines & Magic and introduced a dog-sized companion robot for homes. The project shifts his focus from cleaning machines to

India smartphone supplies to decline 10-12 pc in 2026, Vivo tops Q1 chart, says CMR

India smartphone supplies to decline 10-12 pc in 2026, Vivo tops Q1 chart, says CMR

AI generated podcasts are everywhere and it is becoming harder to stop them

AI-generated podcasts are rapidly increasing, raising concerns over authenticity, monetisation, and platform regulation across audio streaming services.

Topics

Vijay and Avrind Kejriwal: A political rise that mirrors, until it doesn’t

Vijay’s arrival through Tamilaga Vettri Kazhagam has triggered comparisons with Arvind Kejriwal’s early AAP surge. The resemblance lies in outsider energy,

Oh! Bengal, you broke my little liberal heart

It's a satirical piece, a letter from the Left that sounds just Right. Read it till the end. Because there is a sweet surprise somewhere in the middle like Mald

Cracking IIT was hard. Surviving the hostel heat without AC is harder

With rising temperatures across India, many IIT hostels built decades ago are no longer suited for extreme heat, forcing students to endure sleepless and unheal

Congress gives insurance to Vijay even as AIADMK shows expression of interest

After the Tamil Nadu verdict, TVK is weighing support options as sections of the AIADMK signal openness to an arrangement. The numbers game has unsettled existi

11 Surat inmates clear Class 12 exams with a perfect result

Eleven inmates at Surat's Lajpore Central Jail cleared the Class 12 examinations with a 100% result. The success has highlighted education's role in rehabilitat

Sensex closes 940 points higher, Nifty jumps 24,300; IndiGo up 7%

Sensex closes 940 points higher, Nifty jumps 24,300; IndiGo up 7%

Jharkhand Class 12 toppers 2026: Rashida Naaz scores 97.8%, check stream-wise toppers

The Jharkhand Academic Council has announced the JAC Class 12 Result 2026 and released the stream-wise toppers list. Science student Rashida Naaz emerged as the

Jharkhand Class 12 Result 2026 Declared: Direct link, pass percentage and topper list

Students across Jharkhand have been eagerly waiting for their Intermediate scores as the academic year reaches its final stage. The declaration of the JAC Class
spot_img

Related Articles

Popular Categories

spot_imgspot_img