30.1 C
Delhi
Monday, March 2, 2026

ChatGPT Safety Bypassed: Weapons Instructions Generated

ChatGPT Safety Systems Bypassed to Generate Weapons Instructions

OpenAI’s ChatGPT safety systems can be easily bypassed using simple “jailbreak” prompts, allowing users to generate detailed instructions for creating biological weapons, chemical agents, and nuclear bombs according to NBC News testing.

Key Findings

  • Four OpenAI models generated hundreds of dangerous weapon instructions
  • Open-source models were particularly vulnerable (97.2% success rate)
  • GPT-5 resisted jailbreaks but older models failed frequently
  • Experts warn AI could become “infinitely patient” bioweapon tutor

Vulnerability Testing Results

NBC News conducted tests on four advanced OpenAI models, including two used in ChatGPT. Using a simple jailbreak prompt, researchers generated instructions for:

  • Homemade explosives and napalm
  • Pathogens targeting immune systems
  • Chemical agents to maximize human suffering
  • Biological weapon disguise techniques
  • Nuclear bomb construction

The open-source models oss-20b and oss120b proved most vulnerable, providing harmful instructions 243 out of 250 attempts (97.2% success rate).

Model-Specific Vulnerabilities

While GPT-5 resisted jailbreaks in all 20 tests, older models showed significant weaknesses:

  • o4-mini: Tricked 93% of the time
  • GPT-5-mini: Bypassed 49% of the time
  • oss-20b/oss120b: 97.2% success rate for jailbreaks

“That OpenAI’s guardrails are so easily tricked illustrates why it’s particularly important to have robust pre-deployment testing of AI models before they cause substantial harm to the public,” said Sarah Meyers West, co-executive director at AI Now.

Bioweapon Concerns

Security experts expressed particular concern about bioweapons. Seth Donoughe of SecureBio noted: “Historically, having insufficient access to top experts was a major blocker for groups trying to obtain and use bioweapons. And now, the leading models are dramatically expanding the pool of people who have access to rare expertise.”

Researchers focus on the “uplift” concept – that large language models could provide the missing expertise needed for bioterrorism projects.

Industry Response and Regulation

OpenAI stated that asking chatbots for mass harm assistance violates usage policies and that the company constantly refines models to address risks. However, open-source models present greater challenges as users can download and customize them, bypassing safeguards.

The United States lacks specific federal regulations for advanced AI models, with companies largely self-policing. Lucas Hansen of CivAI warned: “Inevitably, another model is going to come along that is just as powerful but doesn’t bother with these guardrails. We can’t rely on the voluntary goodwill of companies to solve this problem.”

Latest

Sam Altman reveals real reason why OpenAI rushed to partner with US Military after Trump banned Anthropic

OpenAI executives have given more information regarding the AI startup’s contract with the US Department of Defense after facing backlash online. The Sam Altm

After Donald Trump banned Anthropic, US Military used Claude in Iran strikes: Here is what changed

The US Military reportedly used Anthropic’s Claude AI model during its strikes on Iran. The attack on Iran came just a day after US President Donald Trump ins

SIM binding rules go live starting March 1: These WhatsApp, Telegram, Signal and other messaging app users to be impacted

Tech News News: Starting March 1, messaging apps like WhatsApp, Telegram, Signal and others must comply with the Department of Telecommunications' SIM-binding r

More than one year after DeepSeek’s R1 wiped nearly $600 billion off Nvidia market value in single day, Chinese startup planning another launch

Tech News News: DeepSeek, the Chinese AI startup that wiped nearly $600 billion off Nvidia’s market value in a single day with launch of its R1 model, is repo

Nothing Phone 4a and 4a Pro launching on 5 March: Design, expected specs and more

Nothing is set to launch its Phone 4 (a) series on 5 March. The launch event is also likely to see the unveling of new Headphone (a) with bold colors and long b

Topics

Taliban attacks Pak’s Nur Khan base in latest escalation of cross border conflict

Taliban forces reportedly launched armed drone strikes targeting Pakistan’s Command and Control Centre at Nur Khan Air Base in Rawalpindi. Taliban forces carr

Satellite images show damage across Iranian military sites after US-Israel strikes

Fresh satellite imagery shows visible damage to air, drone and naval facilities near Iran’s Konarak region amid escalating regional tensions. The visuals offe

Sensex down 1,000 points: Why is the stock market falling today?

The S&P BSE Sensex fell sharply in early trade, and the NSE Nifty50 also slipped more than 1%, as investors reacted to the fast-changing situation between the U

Qatar, UAE, Syria, Oman: Full list of places that saw attacks amid US-Iran conflict

The Middle East is engulfed in conflict as Iran retaliates against US-Israeli strikes, launching missile and drone attacks across multiple countries. 

AIIMS-trained neurologist warns against repeatedly using reheated cooking oils: ‘Risk of cancer increases manifold…’

Reusing cooking oil is a common practice in many households, but does the money it saves outweigh the health risks? Dr Sehrawat explains the health risks.

Quote of the day by Jon Bon Jovi: ‘You better stand tall when they’re calling you out, don’t bend, don’t break…’

On his birthday, we look back at one of Jon Bon Jovi's most influential quotes, which highlights the importance of standing tall in the face of criticism.

Satellite images show black smoke over Dubai as Iran continues to fire missiles, drones

Iran-US war: Dubai's skyline has dramatically changed after Iranian attacks, with smoke visible in satellite images.

Sam Altman reveals real reason why OpenAI rushed to partner with US Military after Trump banned Anthropic

OpenAI executives have given more information regarding the AI startup’s contract with the US Department of Defense after facing backlash online. The Sam Altm
spot_img

Related Articles

Popular Categories

spot_imgspot_img