Google unveils Gemini Embedding 2, its first multimodal embedding model

Google has officially unveiled its first-ever multimodal embedding model, the Gemini Embedding 2. While AI started with being limited to text-only, with the help of Gemini Embedding 2, Google is planning to map text, images, videos, audio and documents into a single space. With the model, Google wants to simplify complex pipelines and enhance a wide variety of multimodal downstream tasks as it supports retrieval-augmented generation (RAG) and semantic search, from sentiment analysis to data clustering. Here’s a detailed look at how this new embedding model works and how you can use it.

Digit.in

Survey

✅ Thank you for completing the survey!
${q.options.map(opt => ` `).join(“”)}

`; // trigger animation const inner = qaContainer.querySelector(“.qa-inner”); setTimeout( => inner.classList.add(“show”), 50); document.querySelectorAll(“input[name=’answer’]”).forEach(radio => { radio.addEventListener(“change”, => submitAnswer(radio.value)); });}function showThankYou { thankYouBox.style.display=”block”; setTimeout( => thankYouBox.classList.add(“show”), 50);}function submitAnswer(answerValue) { const finalPayload = { campaign_name: “Digit Questionaire”, form_name: “Digit Questionaire_Form 1”, form_data: [ { key: “uuid”, value: deviceId, type: “text” }, { key: “question”, value: questions[currentQuestionIndex].question, type: “text” }, { key: “response”, value: answerValue, type: “text” } ], verification_data: { captcha_verification: “”, captchaValue: null, captchaId: null, phone_verification: false, email_verification: false }, meta_data: { referer: document.referrer || window.location.href, user_agent: getDeviceType } }; const myHeaders = new Headers ; myHeaders.append(“accept”, “*/*”); myHeaders.append(“origin”, “https://www.timesdrive.in”); myHeaders.append(“user-agent”, navigator.userAgent); const formdata = new FormData ; formdata.append(“finalPayload”, JSON.stringify(finalPayload)); fetch(“https://apivelocitynext.tnn.in/submit-form-data/68b6d8aac3aa7094b919ac4f/68b6d8f5c3aa7094b919ac89”, { method: “POST”, headers: myHeaders, body: formdata }) .then(res => res.text ) .then(result => console.log(“Submitted:”)) .catch(err => console.error(“Error:”)); currentQuestionIndex++; progress++; setCookie(“progress”, progress); if (currentQuestionIndex { qaBox.style.display=”none”; showThankYou ; }, 600); }}if (currentIndex qaBox.classList.add(“show”), 1200); showQuestion(currentIndex);}

OpenAI to soon integrate Sora AI video tool into ChatGPT: Report

Gemini Embedding 2: How does it work?

First up, speaking of how the model works, Google explained that this new model is based on Gemini. As per them, it leverages its best-in-class multimodal understanding capabilities to create high-quality embeddings across various media.

These media include text-based media that support a context of up to 8192 input tokens. In terms of images, the model is capable of processing up to 6 images per request, supporting both the popular PNG and JPEG formats.

Videos are where things get interesting, as the model supports up to 120 seconds of video input in both MP4 and MOV formats. The model can natively input and embed audio data without needing intermediate text transcriptions, and even directly embed PDFs that are up to 6 pages.

The best part is that this model has been built such that it can process more than one medium at a time. This model can pass multiple media, like image + text, in a single request. As per Google, this would allow the model to work between different media types, unlocking a better understanding of real-world data.

Improvements over previous models

Google also shared the performance difference over the various multimodal models available in the space. As per them, with Gemini Embedding 2, Google is not only improving from their legacy models, but they are also establishing a new performance standard when compared to the other models.

They shared this table, detailing the performance improvements compared to the other models below:

Gemini Embedding 2

Using Google’s new Gemini Embedding 2 multimodal embedding model is pretty simple, too. You can just head on over to either the Gemini API or the Vertex API platform and check it out from there. On their official blogpost, Google has released the code required to access the model.

Pentagon turns to Google Gemini AI assistants after Anthropic dispute

Latest

Motorola Edge 50 Ultra price drops by over Rs 15,000 on Flipkart: Here’s the deal

Motorola Edge 50 Ultra gets a Rs 15,000 discount on Flipkart, bringing the price to Rs 49,999. Buyers can also avail bank offers and exchange deals.

Maamla Legal Hai Season 2: OTT release date, platform, storyline, cast and more

Maamla Legal Hai Season 2 releases on April 3 on Netflix with Ravi Kishan returning as V.D. Tyagi and new courtroom comedy cases in Patparganj.

How to safeguard your UPI account from latest ‘Digital Lutera’ malware

Learn how the Digital Lutera malware targets UPI accounts and discover essential safety tips to protect your Android device from APK-based financial scams.

NASA satellite to crash land on Earth after 14 years: Full story in 5 points

NASA’s 14-year-old Van Allen Probe A satellite is making an uncontrolled re-entry to Earth. Experts say most parts will burn up in the atmosphere.

Intel announces Core Ultra 200S Plus desktop processors

Intel's Core Ultra 200S Plus desktop CPUs boost gaming/multithreaded performance. Features up to 24 cores & faster DDR5 support.

Topics

Flipperachi, rapper behind Dhurandhar’s viral track FA9LA, cancels India performance amid Middle East tensions

Flipperachi, the Bahraini rapper who gained popularity in India after his track FA9LA featured in the spy thriller Dhurandhar, will no longer be perfo.

Daniel Jones signs record-breaking 2-year, $88M deal with Indianapolis Colts

Daniel Jones joined the Indianapolis Colts ahead of the 2025 campaign and quickly proved his value. The deal's structure, with heavy guarantees and record-setti

US responsible for strike on Iranian girls’ school that killed 175, probe finds targeting mistake

New York Times report which cited unnamed US officials familiar with the findings says the strike on February 28 hit the Shajarah Tayyebeh elementary school, ki

IPL 2026 schedule: 3 matches to look forward to from the first 20 fixtures of season

The much-awaited schedule of the 19th edition of the Indian Premier League (IPL) was announced today, albeit only for the first two weeks. However, even these f

Iran rejects 2026 FIFA World Cup participation: Sports minister cites Khamenei assassination and US host role as reasons

Iran sports minister, Ahmad Donyamali pointed directly to the US-led actions that resulted in the death of Supreme Leader Ayatollah Ali Khamenei.

Hansika Motwani-Sohael Khaturiya’s divorce, Thakkali Srinivasan’s demise, Harish Shankar’s apology to Mahesh Babu fans: Top 5 South stories of the day

There has been a lot happening in the South Indian film industry today, with several developments making headlines. From emotional announcements and o.

Trey Hendrickson trade: Baltimore Ravens land pass rusher on 4-year, $112 million deal

Baltimore Ravens' pass rush struggled last season with just 30 sacks, one of the lowest totals in franchise history. Trey Hendrickson, a four-time Pro Bowler an

IEA announces release of 400 million oil barrels from emergency reserves amid Iran war | What it means

The International Energy Agency has agreed to release 400 million barrels of oil from emergency reserves to stabilise markets amid tensions in the Middle East l
spot_img

Related Articles

Popular Categories

spot_imgspot_img