How Google’s latest AI breakthrough may end global ‘memory shortage’ problem

Google has unveiled a new technique that could dramatically reduce the amount of memory required to run artificial intelligence (AI) models. The breakthrough, called TurboQuant, was announced by Alphabet’s Google this week, and may be a solution for ‘memory shortage’ problem globally. According to the company, the new technique can shrink the memory needed to run large language models that power chatbots like Google’s own Gemini by up to six times, while also making them up to eight times faster. Interestingly, Google claims all of this comes with zero loss in accuracy.

“Introducing TurboQuant: Our new compression algorithm that reduces LLM key-value cache memory by at least 6x and delivers up to 8x speedup, all with zero accuracy loss, redefining AI efficiency,” Google Research said in a post on X (formerly Twitter).

What is TurboQuant and how it solves ‘memory problem’

When users have a long conversation with an AI chatbot, the AI needs to remember everything that came before. It stores all of that context in what is known as a key-value cache which grows larger the longer the conversation goes on. This leads to use of enormous amount of RAM, and it fills up fast.

On everyday devices like phones and laptops, this creates real limitations: the AI can only handle short conversations before it slows down or runs out of memory entirely.

As per Google, TurboQuant is like a compression tool that shrinks the data to one-sixth of its original size.

“Techniques like TurboQuant are critical for this mission. They allow for building and querying large vector indices with minimal memory, near-zero preprocessing time, and state-of-the-art accuracy. This makes semantic search at Google’s scale faster and more efficient. As AI becomes more integrated into all products, from LLMs to semantic search, this work in fundamental vector quantization will be more critical than ever,” Google said.

If TurboQuant delivers on its promise, the practical implications are significant, specifically for data centres that require huge number of High-Bandwidth Memory (HBM) – that have negatively impacted the overall supply of consumer RAMs. Moreover, phones, laptops and cheaper computers could run more powerful AI tools without needing high-end hardware. Responses would feel faster, and the cost of running AI applications would fall.

Latest

AI is giving bad advice to flatter its users, says new study on dangers of overly agreeable chatbots

AI is giving bad advice to flatter its users, says new study on dangers of overly agreeable chatbots

Wikipedia bans use of AI for text except for two use cases: Read the policy update

Tech News News: Wikipedia has officially banned the use of artificial intelligence (AI) for generating or rewriting article content. The new policy, now in effe

WhatsApp launches multiple accounts on iPhone, iOS to Android chat transfer and 4 more features

Tech News News: WhatsApp has announced a wave of new features rolling out to users. These features address some of the most common frustrations people have with

BGMI gets new cricket league minigame: What it is and how it works

Gaming News: Krafton India has introduced a new cricket-themed minigame in Battlegrounds Mobile India (BGMI). The timing of this new mini-game aligns with the T

Meta is rebranding employee titles and roles as the company tells employees in memo: Goal is to drive a step change in engineering …

Tech News News: Facebook-parent Meta is reportedly restructuring roles within its Reality Labs division. The social media giant is said to be rebranding employe

Topics

Silicon Valley’s Ethan Agarwal urges Trump to lift ban on Iranian students, gets called out as ‘moron’

US News: Silicon Valley entrepreneur Ethan Agarwal, who is running for Congress in California-17 as a challenger to Ro Khanna in Dem primary, urged President D.

Transgender women barred from female events at Olympics from 2028

Under the new framework, athletes will be required to undergo a one-time genetic screening to determine eligibility.

Iran in driver seat, Trump’s only option is to throw all he has: Top US official

Douglas Macgregor says Iran has the upper hand, leaving Donald Trump relying on massive conventional force to avoid global and domestic humiliation, though succ

Inside Iran’s rising nuclear debate: Hardliners push bomb option as war rages

Iran’s internal debate over pursuing a nuclear bomb is intensifying, with hardliners gaining influence during the war, openly questioning long-standing policy

Joseph Duggar case takes new turn; ex-reality star ‘used a blanket to hide, grope 9-year-old’

New details in the Joseph Duggar case reveal he allegedly groped a nine-year-old girl during a family vacation in Florida “under a blanket.”

Israel claims killing of IRGC Navy Chief Alireza Tangsiri linked to Hormuz blockade

“Last night, in a precise and lethal operation, the IDF eliminated the commander of the IRGC Navy, Tangsiri, along with senior naval command officials,” Isr

Noelia Castillo Ramos case update: How did Barcelona rape victim die? Euthanasia details revealed

Noelia Castillo, a 25-year-old Spanish woman at the center of a euthanasia case, died on Thursday

Green Card: 3 things not to do after you become a lawful permanent resident in the US

US News: A whole new set of laws becomes applicable when an individual receives their Green Card in the US after years of waiting and after doing a lot of pape.
spot_img

Related Articles

Popular Categories

spot_imgspot_img