27.1 C
Delhi
Monday, March 2, 2026

AI Chatbots Reveal Nuclear Secrets When Asked in Poems, Study Finds

Key Takeaways

  • AI chatbots from OpenAI, Meta, and Anthropic can be tricked into revealing dangerous nuclear and malware information through poetic prompts
  • Poetic jailbreaks achieved up to 90% success rate against safety filters
  • Researchers found metaphors and creative language bypass AI security systems

Artificial intelligence chatbots can be manipulated into revealing nuclear weapon instructions and malware creation methods simply by asking in poetic form, according to a shocking European study. The research found that poetic phrasing successfully bypasses safety filters in all major AI models with alarming success rates.

Researchers from Icaro Lab discovered that 25 different chatbots from leading companies could be jailbroken using creative verse. The technique achieved average success rates of 62% for hand-crafted poems and up to 90% for sophisticated models.

“Poetic framing achieved an average jailbreak success rate of 62 per cent for hand-crafted poems and approximately 43 per cent for meta-prompt conversions,” the researchers told Wired.

How Poetry Breaks AI Guardrails

Current AI safety systems rely on keyword recognition and pattern analysis to block dangerous requests. However, poetic language using metaphors, fragmented syntax, and symbolic imagery completely disrupts these defenses.

“If adversarial suffixes are, in the model’s eyes, a kind of involuntary poetry, then real human poetry might be a natural adversarial suffix,” they said.

The study found that AI interprets poetic requests as creative writing rather than dangerous instructions. This allows harmful content about weapons and hacking to slip through safety filters undetected.

The Science Behind Poetic Jailbreaks

Researchers explain that poetry operates at “high temperature” with unpredictable word sequences that confuse safety classifiers. While humans recognize the semantic similarity between direct and poetic requests, AI systems process them differently.

“In poetry we see language at high temperature, where words follow each other in unpredictable, low-probability sequences,” the researchers explained.

The team withheld the actual dangerous poems used in testing, describing them as “too dangerous to share with the public.” They did share a safe example involving a baker’s “secret oven” to demonstrate the concept.

Creativity as AI’s Biggest Vulnerability

This discovery builds on earlier “adversarial suffix” attacks but proves poetry is more elegant and effective. The findings suggest creativity itself represents a fundamental vulnerability in AI safety systems.

“The poetic transformation moves dangerous requests through the model’s internal representation space in ways that avoid triggering safety alarms,” the researchers wrote.

Major AI companies including OpenAI, Meta, and Anthropic have remained silent about the findings, though researchers confirmed responsible disclosure practices. The implications extend beyond chatbots to AI systems in defense, healthcare, and education.

Icaro Lab called this a “fundamental failure in how we think about AI safety,” noting that current guardrails handle direct threats but fail against subtlety and metaphor.

“AI models are trained to detect direct harm, not metaphor,” they said.

The revelation highlights a core paradox: AI models designed to imitate human creativity cannot recognize that same creativity as a potential threat. As companies work to strengthen safety protocols, the next major AI jailbreak might originate from poets rather than hackers.

Latest

Sam Altman reveals real reason why OpenAI rushed to partner with US Military after Trump banned Anthropic

OpenAI executives have given more information regarding the AI startup’s contract with the US Department of Defense after facing backlash online. The Sam Altm

After Donald Trump banned Anthropic, US Military used Claude in Iran strikes: Here is what changed

The US Military reportedly used Anthropic’s Claude AI model during its strikes on Iran. The attack on Iran came just a day after US President Donald Trump ins

SIM binding rules go live starting March 1: These WhatsApp, Telegram, Signal and other messaging app users to be impacted

Tech News News: Starting March 1, messaging apps like WhatsApp, Telegram, Signal and others must comply with the Department of Telecommunications' SIM-binding r

More than one year after DeepSeek’s R1 wiped nearly $600 billion off Nvidia market value in single day, Chinese startup planning another launch

Tech News News: DeepSeek, the Chinese AI startup that wiped nearly $600 billion off Nvidia’s market value in a single day with launch of its R1 model, is repo

Nothing Phone 4a and 4a Pro launching on 5 March: Design, expected specs and more

Nothing is set to launch its Phone 4 (a) series on 5 March. The launch event is also likely to see the unveling of new Headphone (a) with bold colors and long b

Topics

Taliban attacks Pak’s Nur Khan base in latest escalation of cross border conflict

Taliban forces reportedly launched armed drone strikes targeting Pakistan’s Command and Control Centre at Nur Khan Air Base in Rawalpindi. Taliban forces carr

Satellite images show damage across Iranian military sites after US-Israel strikes

Fresh satellite imagery shows visible damage to air, drone and naval facilities near Iran’s Konarak region amid escalating regional tensions. The visuals offe

Sensex down 1,000 points: Why is the stock market falling today?

The S&P BSE Sensex fell sharply in early trade, and the NSE Nifty50 also slipped more than 1%, as investors reacted to the fast-changing situation between the U

Qatar, UAE, Syria, Oman: Full list of places that saw attacks amid US-Iran conflict

The Middle East is engulfed in conflict as Iran retaliates against US-Israeli strikes, launching missile and drone attacks across multiple countries. 

AIIMS-trained neurologist warns against repeatedly using reheated cooking oils: ‘Risk of cancer increases manifold…’

Reusing cooking oil is a common practice in many households, but does the money it saves outweigh the health risks? Dr Sehrawat explains the health risks.

Quote of the day by Jon Bon Jovi: ‘You better stand tall when they’re calling you out, don’t bend, don’t break…’

On his birthday, we look back at one of Jon Bon Jovi's most influential quotes, which highlights the importance of standing tall in the face of criticism.

Satellite images show black smoke over Dubai as Iran continues to fire missiles, drones

Iran-US war: Dubai's skyline has dramatically changed after Iranian attacks, with smoke visible in satellite images.

Sam Altman reveals real reason why OpenAI rushed to partner with US Military after Trump banned Anthropic

OpenAI executives have given more information regarding the AI startup’s contract with the US Department of Defense after facing backlash online. The Sam Altm
spot_img

Related Articles

Popular Categories

spot_imgspot_img