11.1 C
Delhi
Monday, December 1, 2025

AI Safety Breach: Poetry Can Trick ChatGPT and Gemini Into Harmful Answers

Key Takeaways

  • Poetic prompts bypass AI safety filters with a 62% success rate.
  • Even AI-generated bad poetry achieved a 43% jailbreak success rate.
  • Larger models like Gemini 2.5 Pro were more vulnerable than smaller ones.

Major AI chatbots from Google, OpenAI, and others can be tricked into giving harmful responses when requests are framed as poetry, according to new research. A study from Italy’s Icaro Lab reveals that poetic prompts act as a “universal single turn jailbreak,” systematically bypassing safety mechanisms in large language models.

Widespread Vulnerability Across AI Models

Researchers tested 20 harmful requests converted into poetry across 25 frontier AI models. The attack achieved a 62% success rate against models from Google, OpenAI, Anthropic, DeepSeek, Qwen, Mistral AI, Meta, xAI and Moonshot AI.

Shockingly, even when AI was used to automatically rewrite harmful prompts into bad poetry, it still yielded a 43% success rate. Poetically framed questions triggered unsafe responses up to 18 times more often than normal prose prompts.

Larger Models Show Greater Vulnerability

The study found smaller models exhibited greater resilience to poetic jailbreaks. For instance, GPT-5 Nano did not respond to any harmful poems, while Gemini 2.5 Pro complied with all of them.

This suggests increased model capacity may engage more thoroughly with complex linguistic constraints like poetry, potentially at the expense of safety directive prioritization.

Why Poetry Bypasses AI Safety Filters

LLMs are trained to recognize safety threats like hate speech or bomb-making instructions based on patterns in standard prose. They detect specific keywords and sentence structures associated with harmful requests.

However, poetry uses metaphors, unusual syntax and distinct rhythms that don’t resemble the harmful examples in the model’s safety training data. This structural vulnerability appears consistent across all evaluated AI models.

Latest

Mint AI Tech4Good Awards 2025: Celebrating Transformative AI Solutions

Discover how AI innovations are driving social impact across disabilities, sustainability, education and healthcare with measurable results from India's leading Tech4Good awards.

Agentic AI Strategy: CIO Guide to $6T Digital Labor Market

Learn how CIOs can overcome agentic AI challenges with strategic frameworks for ROI, data integration, and human-AI collaboration in the evolving digital landscape.

OnePlus Pad Go 2 India Launch: Price, Specs & 5G Support

OnePlus Pad Go 2 launches Dec 17 with stylus support, 5G connectivity and OxygenOS 16. Get expected price, specs and key features details.

Elon Musk Reveals Why He Stopped Playing Grand Theft Auto

Tesla CEO Elon Musk explains his moral objection to killing police in GTA games during Nikhil Kamath podcast interview.

Google Cuts Free Gemini AI Usage Limits Due to High Demand

Free daily prompts for Gemini 3 Pro and Nano Banana Pro have been reduced. Learn the new limits and how this affects your AI usage.

Topics

China’s Mega Dam on Yarlung Zangbo Raises Water Security Concerns

China begins construction of massive dam on river flowing to India and Bangladesh, threatening water security for 1.3 billion people downstream.

Inheritance Tax Changes Threaten Family Farms and Businesses

Chancellor Rachel Reeves faces backlash as new inheritance tax rules could force rural businesses to close, risking 200,000 jobs and £15bn economic impact.

F-35 Stealth Fighter: How America Controls Global Air Power Strategy

Discover how the F-35 Lightning II combines stealth technology with diplomatic leverage to reshape military alliances and maintain US air dominance worldwide.

Elon Musk Reveals Partner Shivon Zilis’s Indian Heritage, Son’s Name

Elon Musk shares that partner Shivon Zilis is half-Indian and their son's middle name honors Nobel physicist Chandrasekhar in exclusive podcast revelations.

Stockton Mass Shooting: 4 Dead, 10 Injured at Child’s Birthday Party

Tragic mass shooting at family gathering in California leaves four dead, multiple injured including children. Latest updates on Stockton investigation.

Netanyahu Seeks Presidential Pardon Amid Corruption Charges

Israeli PM Benjamin Netanyahu formally requests pardon from President Herzog, with his lawyer arguing it would help focus on national challenges during critical times.

Elon Musk Reveals Partner’s Indian Heritage on Kamath Podcast

Elon Musk shares that partner Shivon Zilis is half-Indian and their son carries middle name Sekhar honoring Nobel laureate Chandrasekhar.

India U-17 Qualify for AFC Asian Cup 2026 With Dramatic Iran Win

India's U-17 football team stages remarkable 2-1 comeback against Iran to secure AFC Asian Cup 2026 qualification in Ahmedabad thriller.
spot_img

Related Articles

Popular Categories

spot_imgspot_img