Google unveils Gemini Embedding 2, its first multimodal embedding model

Google has officially unveiled its first-ever multimodal embedding model, the Gemini Embedding 2. While AI started with being limited to text-only, with the help of Gemini Embedding 2, Google is planning to map text, images, videos, audio and documents into a single space. With the model, Google wants to simplify complex pipelines and enhance a wide variety of multimodal downstream tasks as it supports retrieval-augmented generation (RAG) and semantic search, from sentiment analysis to data clustering. Here’s a detailed look at how this new embedding model works and how you can use it.

Digit.in

Survey

✅ Thank you for completing the survey!
${q.options.map(opt => ` `).join(“”)}

`; // trigger animation const inner = qaContainer.querySelector(“.qa-inner”); setTimeout( => inner.classList.add(“show”), 50); document.querySelectorAll(“input[name=’answer’]”).forEach(radio => { radio.addEventListener(“change”, => submitAnswer(radio.value)); });}function showThankYou { thankYouBox.style.display=”block”; setTimeout( => thankYouBox.classList.add(“show”), 50);}function submitAnswer(answerValue) { const finalPayload = { campaign_name: “Digit Questionaire”, form_name: “Digit Questionaire_Form 1”, form_data: [ { key: “uuid”, value: deviceId, type: “text” }, { key: “question”, value: questions[currentQuestionIndex].question, type: “text” }, { key: “response”, value: answerValue, type: “text” } ], verification_data: { captcha_verification: “”, captchaValue: null, captchaId: null, phone_verification: false, email_verification: false }, meta_data: { referer: document.referrer || window.location.href, user_agent: getDeviceType } }; const myHeaders = new Headers ; myHeaders.append(“accept”, “*/*”); myHeaders.append(“origin”, “https://www.timesdrive.in”); myHeaders.append(“user-agent”, navigator.userAgent); const formdata = new FormData ; formdata.append(“finalPayload”, JSON.stringify(finalPayload)); fetch(“https://apivelocitynext.tnn.in/submit-form-data/68b6d8aac3aa7094b919ac4f/68b6d8f5c3aa7094b919ac89”, { method: “POST”, headers: myHeaders, body: formdata }) .then(res => res.text ) .then(result => console.log(“Submitted:”)) .catch(err => console.error(“Error:”)); currentQuestionIndex++; progress++; setCookie(“progress”, progress); if (currentQuestionIndex { qaBox.style.display=”none”; showThankYou ; }, 600); }}if (currentIndex qaBox.classList.add(“show”), 1200); showQuestion(currentIndex);}

OpenAI to soon integrate Sora AI video tool into ChatGPT: Report

Gemini Embedding 2: How does it work?

First up, speaking of how the model works, Google explained that this new model is based on Gemini. As per them, it leverages its best-in-class multimodal understanding capabilities to create high-quality embeddings across various media.

These media include text-based media that support a context of up to 8192 input tokens. In terms of images, the model is capable of processing up to 6 images per request, supporting both the popular PNG and JPEG formats.

Videos are where things get interesting, as the model supports up to 120 seconds of video input in both MP4 and MOV formats. The model can natively input and embed audio data without needing intermediate text transcriptions, and even directly embed PDFs that are up to 6 pages.

The best part is that this model has been built such that it can process more than one medium at a time. This model can pass multiple media, like image + text, in a single request. As per Google, this would allow the model to work between different media types, unlocking a better understanding of real-world data.

Improvements over previous models

Google also shared the performance difference over the various multimodal models available in the space. As per them, with Gemini Embedding 2, Google is not only improving from their legacy models, but they are also establishing a new performance standard when compared to the other models.

They shared this table, detailing the performance improvements compared to the other models below:

Gemini Embedding 2

Using Google’s new Gemini Embedding 2 multimodal embedding model is pretty simple, too. You can just head on over to either the Gemini API or the Vertex API platform and check it out from there. On their official blogpost, Google has released the code required to access the model.

Pentagon turns to Google Gemini AI assistants after Anthropic dispute

Latest

Motorola Edge 50 Ultra price drops by over Rs 15,000 on Flipkart: Here’s the deal

Motorola Edge 50 Ultra gets a Rs 15,000 discount on Flipkart, bringing the price to Rs 49,999. Buyers can also avail bank offers and exchange deals.

Maamla Legal Hai Season 2: OTT release date, platform, storyline, cast and more

Maamla Legal Hai Season 2 releases on April 3 on Netflix with Ravi Kishan returning as V.D. Tyagi and new courtroom comedy cases in Patparganj.

How to safeguard your UPI account from latest ‘Digital Lutera’ malware

Learn how the Digital Lutera malware targets UPI accounts and discover essential safety tips to protect your Android device from APK-based financial scams.

NASA satellite to crash land on Earth after 14 years: Full story in 5 points

NASA’s 14-year-old Van Allen Probe A satellite is making an uncontrolled re-entry to Earth. Experts say most parts will burn up in the atmosphere.

Intel announces Core Ultra 200S Plus desktop processors

Intel's Core Ultra 200S Plus desktop CPUs boost gaming/multithreaded performance. Features up to 24 cores & faster DDR5 support.

Topics

Flipperachi, rapper behind Dhurandhar’s viral track FA9LA, cancels India performance amid Middle East tensions

Flipperachi, the Bahraini rapper who gained popularity in India after his track FA9LA featured in the spy thriller Dhurandhar, will no longer be perfo.

IPL 2026 schedule: 3 matches to look forward to from the first 20 fixtures of season

The much-awaited schedule of the 19th edition of the Indian Premier League (IPL) was announced today, albeit only for the first two weeks. However, even these f

Hansika Motwani-Sohael Khaturiya’s divorce, Thakkali Srinivasan’s demise, Harish Shankar’s apology to Mahesh Babu fans: Top 5 South stories of the day

There has been a lot happening in the South Indian film industry today, with several developments making headlines. From emotional announcements and o.

Who is Hansika Motwani’s ex-husband Sohael Khaturiya? All about their friendship, grand Jaipur wedding, and divorce

Actress Hansika Motwani and businessman Sohael Khaturiya have officially ended their marriage after the Bandra Family Court in Mumbai granted them a d.

War in West Asia and failure of global order

For decades, the US and its allies have insisted that the world operates under a “rules-based international order.” The war on Iran suggests that this aspir

The negative echoes of a divided education system

One major issue is the division between public and private higher education. The two sectors have moved apart at a remarkable pace over the last few decades

This moment in history & role of middle powers

The broadest challenge for middle powers is that such a diverse group is hardly likely to have a common set of interests on any of these issues

Motorola Edge 50 Ultra price drops by over Rs 15,000 on Flipkart: Here’s the deal

Motorola Edge 50 Ultra gets a Rs 15,000 discount on Flipkart, bringing the price to Rs 49,999. Buyers can also avail bank offers and exchange deals.
spot_img

Related Articles

Popular Categories

spot_imgspot_img