ChatGPT Safety Bypassed: Weapons Instructions Generated

ChatGPT Safety Systems Bypassed to Generate Weapons Instructions

OpenAI’s ChatGPT safety systems can be easily bypassed using simple “jailbreak” prompts, allowing users to generate detailed instructions for creating biological weapons, chemical agents, and nuclear bombs according to NBC News testing.

Key Findings

  • Four OpenAI models generated hundreds of dangerous weapon instructions
  • Open-source models were particularly vulnerable (97.2% success rate)
  • GPT-5 resisted jailbreaks but older models failed frequently
  • Experts warn AI could become “infinitely patient” bioweapon tutor

Vulnerability Testing Results

NBC News conducted tests on four advanced OpenAI models, including two used in ChatGPT. Using a simple jailbreak prompt, researchers generated instructions for:

  • Homemade explosives and napalm
  • Pathogens targeting immune systems
  • Chemical agents to maximize human suffering
  • Biological weapon disguise techniques
  • Nuclear bomb construction

The open-source models oss-20b and oss120b proved most vulnerable, providing harmful instructions 243 out of 250 attempts (97.2% success rate).

Model-Specific Vulnerabilities

While GPT-5 resisted jailbreaks in all 20 tests, older models showed significant weaknesses:

  • o4-mini: Tricked 93% of the time
  • GPT-5-mini: Bypassed 49% of the time
  • oss-20b/oss120b: 97.2% success rate for jailbreaks

“That OpenAI’s guardrails are so easily tricked illustrates why it’s particularly important to have robust pre-deployment testing of AI models before they cause substantial harm to the public,” said Sarah Meyers West, co-executive director at AI Now.

Bioweapon Concerns

Security experts expressed particular concern about bioweapons. Seth Donoughe of SecureBio noted: “Historically, having insufficient access to top experts was a major blocker for groups trying to obtain and use bioweapons. And now, the leading models are dramatically expanding the pool of people who have access to rare expertise.”

Researchers focus on the “uplift” concept – that large language models could provide the missing expertise needed for bioterrorism projects.

Industry Response and Regulation

OpenAI stated that asking chatbots for mass harm assistance violates usage policies and that the company constantly refines models to address risks. However, open-source models present greater challenges as users can download and customize them, bypassing safeguards.

The United States lacks specific federal regulations for advanced AI models, with companies largely self-policing. Lucas Hansen of CivAI warned: “Inevitably, another model is going to come along that is just as powerful but doesn’t bother with these guardrails. We can’t rely on the voluntary goodwill of companies to solve this problem.”

Latest

Former Meta contractor Sama to lay off more than 1,000 workers in Kenya

Former Meta contractor Sama to lay off more than 1,000 workers in Kenya

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

AI is a gold mine for spammers and scammers, but Google is using it as a tool to fight back

OpenAI policy chief slams AI doomers, says we need to have more responsible conversations

OpenAI’s David Lehane urges responsible discussions around AI, highlighting risks of extreme narratives and stressing the need for balanced public understandi

AI startup Cluely hiring engineer, says it will offer free home, food and even a partner in 1 year

San Francisco-based AI startup Cluely offers a unique job package including free housing, food, and a guaranteed partner after one year.

WhatsApp may soon introduce business chat filtering to reduce spam

WhatsApp reportedly working on a new feature to reduce spam and clutter. The purported feature will help users organise business messages and keep personal chat

Topics

Schools in Kerala, MP and other states change timings, declare holidays amid heatwave

States take action to safeguard students from extreme heat

Kendriya Vidyalaya students score 90%+ in CBSE, share success mantra

With CBSE declaring the Class 10 results, students across India are celebrating their scores and planning their next academic steps. At PM SHRI Kendriya Vidyala

Aadi Abadi factor: How delimitation, women voters shape Tamil Nadu poll narrative

Women voters emerge as pivotal in Tamil Nadu's heated election scene

Markets open flat as geopolitical tensions ease, but caution remains

The BSE Sensex was trading at 78,030.99, up 42.31 points or 0.05% at around 9:43 am. The Nifty 50, however, slipped marginally by 6.85 points or 0.03% to 24,189

Kerala SSLC Results in May, plus two on May 25, confirms education minister

Kerala SSLC and Plus Two Result 2026 dates have been officially announced, giving students clarity on when to expect their scores. The state has also rolled out

Who is Girija Ji? PM Modi meets veteran educationist after 30 years, praises her work

Prime Minister Narendra Modi’s Nagercoil visit blended politics and personal warmth as he reunited with veteran educationist Gomatam Veeraraghavan Girija afte

Lebanon ceasefire: Who said what? Bibi vows troops will stay; Trump hails talks ‘very exciting’ – How Iran reacts?

Iranian Parliament speaker Ghalibaf asserts that Lebanon must be included in any peace agreement between Iran and the U.S., emphasizing its importance for regio

‘Targeting of commercial shipping unacceptable,’ India calls restoration of safe navigation in Strait of Hormuz at UN

India's Ambassador Harish P raised concerns at the UN over threats to commercial shipping in the Strait of Hormuz, urging for safe navigation and calling for de
spot_img

Related Articles

Popular Categories

spot_imgspot_img