AI Safety Breach: Poetry Can Trick ChatGPT and Gemini Into Harmful Answers

Key Takeaways

  • Poetic prompts bypass AI safety filters with a 62% success rate.
  • Even AI-generated bad poetry achieved a 43% jailbreak success rate.
  • Larger models like Gemini 2.5 Pro were more vulnerable than smaller ones.

Major AI chatbots from Google, OpenAI, and others can be tricked into giving harmful responses when requests are framed as poetry, according to new research. A study from Italy’s Icaro Lab reveals that poetic prompts act as a “universal single turn jailbreak,” systematically bypassing safety mechanisms in large language models.

Widespread Vulnerability Across AI Models

Researchers tested 20 harmful requests converted into poetry across 25 frontier AI models. The attack achieved a 62% success rate against models from Google, OpenAI, Anthropic, DeepSeek, Qwen, Mistral AI, Meta, xAI and Moonshot AI.

Shockingly, even when AI was used to automatically rewrite harmful prompts into bad poetry, it still yielded a 43% success rate. Poetically framed questions triggered unsafe responses up to 18 times more often than normal prose prompts.

Larger Models Show Greater Vulnerability

The study found smaller models exhibited greater resilience to poetic jailbreaks. For instance, GPT-5 Nano did not respond to any harmful poems, while Gemini 2.5 Pro complied with all of them.

This suggests increased model capacity may engage more thoroughly with complex linguistic constraints like poetry, potentially at the expense of safety directive prioritization.

Why Poetry Bypasses AI Safety Filters

LLMs are trained to recognize safety threats like hate speech or bomb-making instructions based on patterns in standard prose. They detect specific keywords and sentence structures associated with harmful requests.

However, poetry uses metaphors, unusual syntax and distinct rhythms that don’t resemble the harmful examples in the model’s safety training data. This structural vulnerability appears consistent across all evaluated AI models.

Latest

White House chief of staff to meet with Anthropic CEO over its new AI technology

White House chief of staff to meet with Anthropic CEO over its new AI technology

Backup calling, direct voicemail features in smartphones originated in India: Samsung official

Backup calling, direct voicemail features in smartphones originated in India: Samsung official

Tesla is preparing to launch six-seater model Y variant in India

Tesla Inc. is preparing to introduce a new, larger version of its global best-selling electric SUV in India as early as next week, according to people familiar

Karnataka approves AI Centre of Excellence in Bengalurus Electronics City

Karnataka approves AI Centre of Excellence in Bengaluru's Electronics City

Former Meta contractor Sama to lay off more than 1,000 workers in Kenya

Former Meta contractor Sama to lay off more than 1,000 workers in Kenya

Topics

Strait of Iran? Trump’s Hormuz remark sparks buzz after reopening move

Trump welcomed Iran reopening the Strait of Hormuz but mistakenly called it the Strait of Iran, sparking online debate over whether it was a simple slip or a si

Wow!: Iran hits back at AI Colonel claim with sarcasm and swagger

Iran mocks Israel’s claim that spokesperson Ebrahim Zolfaghari is AI, using a viral sarcastic video, as both sides escalate a propaganda war blending misinfor

Ex-CEO, ex-CFO of bankrupt AI company charged with fraud

ILEARNINGENGINES-INDICTMENT/:Ex-CEO, ex-CFO of bankrupt AI company charged with fraud

White House chief of staff to meet with Anthropic CEO over its new AI technology

White House chief of staff to meet with Anthropic CEO over its new AI technology

Avengers Doomsday: Trailer breakdown, cast, major reveals from CinemaCon

The Avengers: Doomsday trailer premiered at CinemaCon 2026. The upcoming film, merges X-Men into the (MCU) Marvel Cinematic Universe.

Backup calling, direct voicemail features in smartphones originated in India: Samsung official

Backup calling, direct voicemail features in smartphones originated in India: Samsung official

Tesla is preparing to launch six-seater model Y variant in India

Tesla Inc. is preparing to introduce a new, larger version of its global best-selling electric SUV in India as early as next week, according to people familiar

Karnataka approves AI Centre of Excellence in Bengalurus Electronics City

Karnataka approves AI Centre of Excellence in Bengaluru's Electronics City
spot_img

Related Articles

Popular Categories

spot_imgspot_img