ChatGPT, Gemini and Grok confidently generate dangerous medical advice half the time, study finds

While there has been a lot of debate about the use of AI for healthcare, a new study published in the medical journal BMJ Open has found that around half the advice given by popular AI chatbots is false. The study, first reported by Bloomberg, evaluated five major AI platforms to highlight the growing health risks associated with generative AI platforms.

What did the study find?

The research published this week tested ChatGPT, Gemini, Meta AI, Grok, and DeepSeek and asked each of the chatbots 10 questions across five health categories. Out of the total responses generated, the researchers found that 50 percent contained problematic medical information. Furthermore, the study noted that nearly 20 percent of the generated answers were classified as highly problematic.

The researchers from the US, Canada, and the UK also found that the AI models performed relatively well when handling closed-ended questions concerning established medical topics, such as cancer and vaccines. However, the models struggled significantly to provide safe answers for open-ended queries or complex health subjects like nutrition and stem cells.

A major concern raised in the report is the authoritative tone these models adopt despite lacking clinical judgment or the licences to issue medical diagnoses. The research noted that the AI chatbots delivered answers to the health questions with confidence and certainty even when they could not provide a complete and accurate list of medical references to support their claims.

Out of the tested chatbots across the 10 questions, the researchers say there were only two refusals to answer a question, both of which came from Meta AI.

The authors of the study point out that a major risk for the deployment of these chatbots without proper oversight and public education could lead to them amplifying the spread of misinformation.

“These systems can generate authoritative-sounding but potentially flawed responses,” the researchers explained in the report. They added that the findings “highlight important behavioural limitations and the need to reevaluate how AI chatbots are deployed in public-facing health and medical communication”.

The new study comes at a time when AI companies have been positioning their AI tools to have a bigger say in healthcare. OpenAI launched its ChatGPT Health earlier this year, which allows users to share their personal health data with the popular AI chatbot to receive more grounded results.

Meanwhile, Anthropic also launched Claude for Healthcare, which allows its paid users in the US to securely connect their medical records.

Latest

EU threatens to force Meta to restore WhatsApp full access for rival AI chatbots

EU threatens to force Meta to restore WhatsApp full access for rival AI chatbots

Gujarat HC issues notices to Meta, X, Google over PIL seeking curb on misuse of AI

Gujarat HC issues notices to Meta, X, Google over PIL seeking curb on misuse of AI

Oppo F33, F33 Pro with 6.57-inch AMOLED display and 7,000mAh battery launched in India: Price, specs and features

OPPO has launched the F33 5G and F33 Pro 5G in India, featuring a 7000mAh battery, MediaTek Dimensity 6360 MAX processor, and military-grade durability. Prices

Vivo T5 Pro 5G with Snapdragon 7s Gen 4 SoC and 9,020mAh battery launched in India, price starts at…

Vivo has launched the T5 Pro 5G featuring a 9020mAh battery, Snapdragon 7s Gen 4 processor, and military-grade durability. The phone start at a price of ₹29

Over 100 Chrome extensions caught stealing Google and Telegram data: How to stay safe?

Cybersecurity experts have reported a coordinated attack involving 108 Google Chrome extensions that steal user data and hijack Telegram sessions. Researchers s

Topics

Not as versatile as Bumrah, but as effective: Behind Josh Hazlewood’s IPL success

Hazlewood's precision with the ball guides RCB to a crucial victory

Iran signals Hormuz shift: Free passage via Oman on table in US talks

The plan signals a softer tone from Tehran after weeks of hardline rhetoric, though uncertainty over mines, access rules and US response keeps the situation fra

About to eliminate: Netanyahu says Israel closing in on Hezbollah stronghold

Tensions escalate as Israeli forces advance on Hezbollah in Lebanon

Fish and more: BJP leaders celebrate Bengali New Year to woo voters

BJP leaders celebrated the first day of the Bengali New Year with fish in their hands, a move that was aimed at winning the hearts of Bengal voters.

UN Awaits Go-Ahead to Move Fertilizer Through Strait of Hormuz

The United Nations is ready to set up a corridor to allow fertilizer to move freely through the Strait of Hormuz and reach farmers for the planting season — b

Roblox gaming platform reaches $12 million settlement with Nevada enhancing youth protections

Roblox gaming platform reaches $12 million settlement with Nevada enhancing youth protections

Saudi Wealth Fund Sets New Strategy to Build Global Champions

Saudi Arabia’s sovereign wealth fund will step up efforts to boost returns and build portfolio companies into global champions as the kingdom contends with th

EU threatens to force Meta to restore WhatsApp full access for rival AI chatbots

EU threatens to force Meta to restore WhatsApp full access for rival AI chatbots
spot_img

Related Articles

Popular Categories

spot_imgspot_img