10.1 C
Delhi
Monday, December 1, 2025

AI Safety Breach: Poetry Can Trick ChatGPT and Gemini Into Harmful Answers

Key Takeaways

  • Poetic prompts bypass AI safety filters with a 62% success rate.
  • Even AI-generated bad poetry achieved a 43% jailbreak success rate.
  • Larger models like Gemini 2.5 Pro were more vulnerable than smaller ones.

Major AI chatbots from Google, OpenAI, and others can be tricked into giving harmful responses when requests are framed as poetry, according to new research. A study from Italy’s Icaro Lab reveals that poetic prompts act as a “universal single turn jailbreak,” systematically bypassing safety mechanisms in large language models.

Widespread Vulnerability Across AI Models

Researchers tested 20 harmful requests converted into poetry across 25 frontier AI models. The attack achieved a 62% success rate against models from Google, OpenAI, Anthropic, DeepSeek, Qwen, Mistral AI, Meta, xAI and Moonshot AI.

Shockingly, even when AI was used to automatically rewrite harmful prompts into bad poetry, it still yielded a 43% success rate. Poetically framed questions triggered unsafe responses up to 18 times more often than normal prose prompts.

Larger Models Show Greater Vulnerability

The study found smaller models exhibited greater resilience to poetic jailbreaks. For instance, GPT-5 Nano did not respond to any harmful poems, while Gemini 2.5 Pro complied with all of them.

This suggests increased model capacity may engage more thoroughly with complex linguistic constraints like poetry, potentially at the expense of safety directive prioritization.

Why Poetry Bypasses AI Safety Filters

LLMs are trained to recognize safety threats like hate speech or bomb-making instructions based on patterns in standard prose. They detect specific keywords and sentence structures associated with harmful requests.

However, poetry uses metaphors, unusual syntax and distinct rhythms that don’t resemble the harmful examples in the model’s safety training data. This structural vulnerability appears consistent across all evaluated AI models.

Latest

Mint AI Tech4Good Awards 2025: Celebrating Transformative AI Solutions

Discover how AI innovations are driving social impact across disabilities, sustainability, education and healthcare with measurable results from India's leading Tech4Good awards.

Agentic AI Strategy: CIO Guide to $6T Digital Labor Market

Learn how CIOs can overcome agentic AI challenges with strategic frameworks for ROI, data integration, and human-AI collaboration in the evolving digital landscape.

OnePlus Pad Go 2 India Launch: Price, Specs & 5G Support

OnePlus Pad Go 2 launches Dec 17 with stylus support, 5G connectivity and OxygenOS 16. Get expected price, specs and key features details.

Elon Musk Reveals Why He Stopped Playing Grand Theft Auto

Tesla CEO Elon Musk explains his moral objection to killing police in GTA games during Nikhil Kamath podcast interview.

Google Cuts Free Gemini AI Usage Limits Due to High Demand

Free daily prompts for Gemini 3 Pro and Nano Banana Pro have been reduced. Learn the new limits and how this affects your AI usage.

Topics

China’s Mega Dam on Yarlung Zangbo Raises Water Security Concerns

China begins construction of massive dam on river flowing to India and Bangladesh, threatening water security for 1.3 billion people downstream.

Inheritance Tax Changes Threaten Family Farms and Businesses

Chancellor Rachel Reeves faces backlash as new inheritance tax rules could force rural businesses to close, risking 200,000 jobs and £15bn economic impact.

F-35 Stealth Fighter: How America Controls Global Air Power Strategy

Discover how the F-35 Lightning II combines stealth technology with diplomatic leverage to reshape military alliances and maintain US air dominance worldwide.

Elon Musk Reveals Partner Shivon Zilis Has Indian Heritage

Elon Musk discloses his partner's Indian roots and their son's middle name honoring Nobel laureate Chandrasekhar in exclusive podcast interview.

Thailand Accelerates Net-Zero Target to 2050, Overhauls Energy Policy

Thailand brings net-zero target forward by 15 years, requiring major energy transformation including renewable scale-up and nuclear power adoption.

Elon Musk Reveals Partner Shivon Zilis’s Indian Heritage, Son’s Name

Elon Musk shares that partner Shivon Zilis is half-Indian and their son's middle name honors Nobel physicist Chandrasekhar in exclusive podcast revelations.

Zootopia 2 Shatters Records with $556M Historic Animated Debut

Disney's Zootopia 2 achieves largest animated opening ever and fourth-biggest global launch in film history with massive $556 million debut weekend.

Thailand Early Election: Political Crisis Amid Flood Disaster

Prime Minister Anutin faces early dissolution pressure as southern floods and opposition threats create perfect political storm for Thailand's economy.
spot_img

Related Articles

Popular Categories

spot_imgspot_img