ChatGPT, Gemini and Grok confidently generate dangerous medical advice half the time, study finds

While there has been a lot of debate about the use of AI for healthcare, a new study published in the medical journal BMJ Open has found that around half the advice given by popular AI chatbots is false. The study, first reported by Bloomberg, evaluated five major AI platforms to highlight the growing health risks associated with generative AI platforms.

What did the study find?

The research published this week tested ChatGPT, Gemini, Meta AI, Grok, and DeepSeek and asked each of the chatbots 10 questions across five health categories. Out of the total responses generated, the researchers found that 50 percent contained problematic medical information. Furthermore, the study noted that nearly 20 percent of the generated answers were classified as highly problematic.

The researchers from the US, Canada, and the UK also found that the AI models performed relatively well when handling closed-ended questions concerning established medical topics, such as cancer and vaccines. However, the models struggled significantly to provide safe answers for open-ended queries or complex health subjects like nutrition and stem cells.

A major concern raised in the report is the authoritative tone these models adopt despite lacking clinical judgment or the licences to issue medical diagnoses. The research noted that the AI chatbots delivered answers to the health questions with confidence and certainty even when they could not provide a complete and accurate list of medical references to support their claims.

Out of the tested chatbots across the 10 questions, the researchers say there were only two refusals to answer a question, both of which came from Meta AI.

The authors of the study point out that a major risk for the deployment of these chatbots without proper oversight and public education could lead to them amplifying the spread of misinformation.

“These systems can generate authoritative-sounding but potentially flawed responses,” the researchers explained in the report. They added that the findings “highlight important behavioural limitations and the need to reevaluate how AI chatbots are deployed in public-facing health and medical communication”.

The new study comes at a time when AI companies have been positioning their AI tools to have a bigger say in healthcare. OpenAI launched its ChatGPT Health earlier this year, which allows users to share their personal health data with the popular AI chatbot to receive more grounded results.

Meanwhile, Anthropic also launched Claude for Healthcare, which allows its paid users in the US to securely connect their medical records.

Latest

EU threatens to force Meta to restore WhatsApp full access for rival AI chatbots

EU threatens to force Meta to restore WhatsApp full access for rival AI chatbots

OpenAI calls India advanced coding market, Codex users grow 4x in 2 weeks

OpenAI report highlights India as a leading AI market in coding, data analysis, and reasoning, while pointing to gaps in access.

Donald Trump says AI must have kill switch, highlights existential threat to humans

Trump addresses AI risks and benefits in banking sector interview

Redmi K90 Max launch date announced, coming with 8,550mAh battery and Dimensity 9500

Redmi has announced the launch date of the Redmi K90 Max. The device will break cover in China. The company confirms that it will feature a 165Hz refresh rate p

Samsung Galaxy S25 Ultra gets Rs 30,000 price cut, S25 and S25 FE now up to Rs 32,000 cheaper

Samsung reduces prices of Galaxy S25 series smartphones following the launch of Galaxy S26 lineup in India.

Topics

US warns of sanctions on Iran oil buyers as blockade tightens

China remains central to the strategy, having accounted for the bulk of Iran’s seaborne oil exports. US officials say the pressure from the blockade could sig

R Vaishali’s Checkmate: From ‘Pragg’s sister’ to World Championship challenger

Long the other Rameshbabu in a house of genius, the elder sibling has finally pulled rank. By conquering the Candidates in Cyprus, R Vaishali secured a World Ch

Iran, resistance stand as one in war and truce, Tehran warns US of Israel first

Ghalibaf's statement complicates US-Iran ceasefire dynamics

KKR should open with Cameron Green: Aaron Finch on all-rounder’s struggles in IPL 2026

IPL 2026: Cameron Green has struggled after returning from injury and being used out of position, with poor performances in both batting and bowling adding to K

African Catholics recoil as Trump’s Jesus post and spat with Pope deepen tensions

Tensions have been building since late March, when Pope Leo — the first US-born leader of the Catholic Church — began speaking out against the Iran conflict

Not as versatile as Bumrah, but as effective: Behind Josh Hazlewood’s IPL success

Hazlewood's precision with the ball guides RCB to a crucial victory

Iran signals Hormuz shift: Free passage via Oman on table in US talks

The plan signals a softer tone from Tehran after weeks of hardline rhetoric, though uncertainty over mines, access rules and US response keeps the situation fra

About to eliminate: Netanyahu says Israel closing in on Hezbollah stronghold

Tensions escalate as Israeli forces advance on Hezbollah in Lebanon
spot_img

Related Articles

Popular Categories

spot_imgspot_img