GPT-5.5 launched: OpenAI says ChatGPT can now debug code and operate software

Soon after launching its Images 2.0 model, OpenAI has released yet another AI model. The San Francisco-based AI startup announced a major update to ChatGPT as the company debuted its GPT-5.5 model on Thursday, calling it its “smartest and most intuitive” model to date.

What’s new with GPT-5.5?

OpenAI says GPT-5.5 is more efficient in how it works through problems and is capable of reaching higher-quality outputs with fewer tokens and retries.

OpenAI co-founder and President Greg Brockman, in a post on X (formerly Twitter), wrote, “GPT-5.5 is a new class of intelligence. This intelligence makes it intuitive to use; it completes challenging tasks with little micromanagement. Also very token efficient, and runs with low latency and at scale. A real step toward a new way of getting computer work done.”

Agentic coding and software engineering

OpenAI says GPT-5.5 is its strongest agentic coding model yet, capable of handling end-to-end engineering tasks like implementation, refactoring, and debugging.

The company shared various benchmarks to elaborate on the quality of its new model. It noted:

  • On Terminal-Bench 2.0, which tests complex command-line workflows and tool coordination, the model achieved a state-of-the-art accuracy of 82.7%.
  • On SWE-Bench Pro, which evaluates real-world GitHub issue resolution, it reached 58.6%, solving more tasks in a single pass than its predecessors.

OpenAI also says that early testers noted the model possesses stronger conceptual clarity, capable of understanding the broader shape of a system and successfully navigating ambiguous failures.

A co-scientist

Beyond coding, OpenAI claims the new model fundamentally changes how knowledge work and scientific research are done. The company says that since its new AI is better at understanding intent, it moves more naturally through the full loop of finding information: using tools, checking the output, and turning raw material into something useful.

  • The model scored 84.9% on GDPval, which tests knowledge work across 44 occupations, and 78.7% on OSWorld-Verified for operating real computer environments.
  • In scientific applications, the model achieved 80.5% on BixBench, a benchmark designed for real-world bioinformatics and data analysis.

OpenAI also highlighted that an internal version of GPT-5.5 even helped discover a new mathematical proof regarding Ramsey numbers, a complex area of combinatorics that studies how order inevitably emerges in large enough systems.

Cybersecurity protection:

Owing to the improvements in its new model, OpenAI says it has designed tighter controls around higher-risk activity, sensitive cyber requests, and added protections for repeated misuse.

“With GPT-5.5, we are ensuring developers can secure their code with ease, while putting stronger controls around the cyber workflows most likely to cause harm by malicious actors,” OpenAI said.

Notably, OpenAI’s chief rival Anthropic had recently refused to unveil its Mythos AI model owing to the advanced cybersecurity risks it posed.

OpenAI has also launched a “Trusted Access for Cyber” program which allows verified organisations defending critical infrastructure to access cyber-permissive models with fewer restrictions.

“This gives a wide range of verified defenders more capable tools for legitimate security work with less unnecessary friction to ensure we democratise access to important defensive capabilities,” the company wrote in its blog post.

Benchmark (Category) GPT-5.5 GPT-5.4 Claude Opus 4.7 Gemini 3.1 Pro
Terminal-Bench 2.0 (Agentic Coding) 82.7% 75.1% 69.4% 68.5%
SWE-Bench Pro (Real-world Coding) 58.6% 57.7% 64.3% 54.2%
Expert-SWE (Internal Coding Eval) 73.1% 68.5%
GDPval (Professional Knowledge Work) 84.9% 83.0% 80.3% 67.3%
FinanceAgent v1.1 (Professional) 60.0% 56.0% 64.4% 59.7%
OSWorld-Verified (Computer Use) 78.7% 75.0% 78.0%
BrowseComp (Tool Use) 84.4% 82.7% 79.3% 85.9%
GeneBench (Academic/Biology) 25.0% 19.0%
BixBench (Bioinformatics) 80.5% 74.0%
FrontierMath Tier 1–3 (Academic Math) 51.7% 47.6% 43.8% 36.9%
GPQA Diamond (Academic) 93.6% 92.8% 94.2% 94.3%
CyberGym (Cybersecurity) 81.8% 79.0% 73.1%
ARC-AGI-1 (Abstract Reasoning) 95.0% 93.7% 93.5% 98.0%

How to use GPT-5.5?

OpenAI says GPT-5.5 is currently rolling out to Plus, Pro, Business, and Enterprise users in ChatGPT and Codex. Meanwhile, the more advanced GPT-5.5 Pro model is also rolling out to Pro, Business, and Enterprise ChatGPT users.

The company did not reveal when the new models will be arriving for free and Go users.

Latest

AI smart glasses will help visually impaired runners take on the London Marathon

AI smart glasses will help visually impaired runners take on the London Marathon

You can now ask ChatGPT to find cheap flights with the new Skyscanner integration: step-by-step guide

Skyscanner has launched its app within ChatGPT allowing users in India and globally to search for flights using conversational prompts inside the chatbot

Did Anthropic ‘dumb down’ Claude Code? Post-mortem reveals the three bugs that crippled performance

Anthropic has acknowledged complaints regarding Claude Code's performance, attributing issues to three updates that affected coding quality.

IPhone 18 Pro, iPhone 18 Pro Max and iPhone Ultra complete design changes revealed in new leak

A new leak has via iPhone dummy models has revealed the designs of the iPhone 18 pro, iPhone 18 Pro Max and iPhone Ultra.

DeepSeek is back: China’s AI claims to surpass ChatGPT and Gemini in key benchmarks

DeepSeek has introduced its new DeepSeek-V4 AI models, comprising Pro and Flash versions. The new model claims to compete with ChatGPT and Gemini in many key be

Topics

Michael Box Office Collection: Jaafar Jackson film breaks records with $12.6M US previews despite poor reviews

Lionsgate's Michael Jackson biopic 'Michael' is heading for a record-breaking opening weekend with $12.6 million in US previews and $18.5 million internationall

Khal Nayak is back: Sanjay Dutt unveils teaser, revives iconic role in new Jio Studios film

Sanjay Dutt and Aksha Kamboj have acquired rights to the 1993 film Khal Nayak, with Jio Studios set to produce a new project. The move signals a revival of the

AI smart glasses will help visually impaired runners take on the London Marathon

AI smart glasses will help visually impaired runners take on the London Marathon

Iran’s FM Abbas Araghchi to visit Pakistan, confirms Iranian state media

The US logistics and security team have already reached Islamabad, Reuters reported citing government sources.

Explained: Why Iran is not ready to compromise with US despite pressure

US-Iran conflict: Tensions between Washington and Tehran remain on edge as diplomatic efforts to secure a truce show no signs of progress. Earlier this week,

Situation in Iran remains serious, Embassy providing assistance to Indian nationals: MEA

Earlier this week, US President Donald Trump unilaterally extended the ceasefire with Iran indefinitely, hours before it was to expire, even though Tehran refus

No China, no Gulf, Dhurandhar 2’s box office supremacy still unabated after 5 weeks

Dhurandhar: The Revenge has completed 36 days in theatres and is closing in on Baahubali 2's global total. Its run stands out because it has crossed Rs 1,766 cr

Raghav Chadha along with two other Rajya Sabha MPs officially join BJP

Earlier today, Raghav Chadha held a press conference along with other Rajya Sabha MPs, where he announced his resignation from the AAP. Additionally, he also an
spot_img

Related Articles

Popular Categories

spot_imgspot_img