5.1 C
Delhi
Friday, January 16, 2026

AI Safety Breach: Poetry Can Trick ChatGPT and Gemini Into Harmful Answers

Key Takeaways

  • Poetic prompts bypass AI safety filters with a 62% success rate.
  • Even AI-generated bad poetry achieved a 43% jailbreak success rate.
  • Larger models like Gemini 2.5 Pro were more vulnerable than smaller ones.

Major AI chatbots from Google, OpenAI, and others can be tricked into giving harmful responses when requests are framed as poetry, according to new research. A study from Italy’s Icaro Lab reveals that poetic prompts act as a “universal single turn jailbreak,” systematically bypassing safety mechanisms in large language models.

Widespread Vulnerability Across AI Models

Researchers tested 20 harmful requests converted into poetry across 25 frontier AI models. The attack achieved a 62% success rate against models from Google, OpenAI, Anthropic, DeepSeek, Qwen, Mistral AI, Meta, xAI and Moonshot AI.

Shockingly, even when AI was used to automatically rewrite harmful prompts into bad poetry, it still yielded a 43% success rate. Poetically framed questions triggered unsafe responses up to 18 times more often than normal prose prompts.

Larger Models Show Greater Vulnerability

The study found smaller models exhibited greater resilience to poetic jailbreaks. For instance, GPT-5 Nano did not respond to any harmful poems, while Gemini 2.5 Pro complied with all of them.

This suggests increased model capacity may engage more thoroughly with complex linguistic constraints like poetry, potentially at the expense of safety directive prioritization.

Why Poetry Bypasses AI Safety Filters

LLMs are trained to recognize safety threats like hate speech or bomb-making instructions based on patterns in standard prose. They detect specific keywords and sentence structures associated with harmful requests.

However, poetry uses metaphors, unusual syntax and distinct rhythms that don’t resemble the harmful examples in the model’s safety training data. This structural vulnerability appears consistent across all evaluated AI models.

Latest

Meta Bans ChatGPT on WhatsApp from 2026: How to Save Chats

WhatsApp will block ChatGPT and third-party AI tools in 2026. Learn why Meta is banning AI, how to back up your chat history, and what it means for users.

Amazon Republic Day Sale 2026: Up to 80% Off on Gadgets & Appliances

Amazon's Great Republic Day Sale 2026 is live with massive discounts on electronics, fashion & home appliances. Get top deals, no-cost EMI & a chance to win a trip.

Amazon Republic Day Sale: iPhone 15, OnePlus Nord 5, iQOO 15 Big Discounts

Get record-low prices on iPhone 15, OnePlus Nord 5, and iQOO 15 during Amazon's Great Republic Day Sale 2025 from Jan 14-18. Details on discounts, bank offers, and early access.

CERT-In Flags High-Risk Dolby Bug on Android, Urges Patch

Indian cybersecurity agency warns of a critical Dolby Audio vulnerability in Android 13/14. Learn how to protect your device with the latest security update.

McKinsey Makes AI Tool Mandatory in Job Interviews for Hiring

McKinsey now requires candidates to use its 'Lilli' AI tool during interviews. Failure to use it could lead to rejection, highlighting a major shift in hiring skills.

Topics

Trump’s Greenland Purchase Interest Sparks Diplomatic Row with Denmark

US President confirms interest in buying Greenland, but Denmark and Greenland firmly reject the idea. Explore the strategic reasons and the criticism behind the move.

Machado Meets Trump, Gifts Nobel Replica in Venezuela Power Play

Barred Venezuelan opposition leader María Corina Machado's strategic meeting with Donald Trump aims to maintain pressure on Maduro ahead of the July election.

Princess Leila Pahlavi: The Shah’s Daughter Who Died Alone in Exile

The tragic story of Iranian Princess Leila Pahlavi, who fled the 1979 revolution and died by suicide at 31, revealing the human cost of political upheaval.

Zomato’s Viral Job: Rs 25 Lakh Salary for 1-3 Years Experience in Bengaluru

A Zomato job listing offering Rs 25 lakh salary, Rs 20 lakh ESOP, and daily food credits for a role needing just 1-3 years experience goes viral, sparking debate.

India to Evacuate Citizens from Iran; First Flight from Tehran Tomorrow

MEA prepares evacuation flights for Indians in Iran amid Iran-Israel conflict. First flight from Tehran to Delhi scheduled. Embassy issues urgent travel advisory.

Australia Social Media Ban: 5 Million Kids’ Accounts Deleted in a Month

Australia's new social media ban leads to removal of nearly 5 million under-14 accounts. Learn about the law, enforcement, and the debate it has sparked.

Rising Memory Chip Prices Threaten Profits for Apple, HP, Dell

Morgan Stanley warns investors as increasing DRAM and NAND flash costs squeeze margins for major tech hardware companies, reversing a years-long tailwind.

Mumbai Markets Closed for BMC Elections, Zerodha CEO Calls It Poor Planning

Zerodha CEO Nithin Kamath criticises weekday market closure for Mumbai elections, highlighting economic costs and missed trading opportunities as Asian markets rally.
spot_img

Related Articles

Popular Categories

spot_imgspot_img