OpenAI on Thursday released its latest flagship artificial intelligence (AI) model focused on agentic workflows, GPT-5.5, less than two months after the debut of GPT-5.4.

The new model offers enhanced reasoning, advanced coding capabilities, and the ability to navigate software with minimal human involvement.

“It can look at an unclear problem and figure out just what needs to happen next,” OpenAI President Greg Brockman said. “It feels like it’s setting the foundation for how we’re going to use computers and do computer work going forward.”

Beyond standard text generation, GPT-5.5 excels at debugging code, researching online, and managing complex spreadsheets. Chief Research Officer Mark Chen noted that the model shows “meaningful gains” in scientific workflows, suggesting it could eventually assist experts in fields as specialized as drug discovery.

The increased capability brings heightened scrutiny. OpenAI disclosed that while GPT-5.5 does not cross the critical cybersecurity risk threshold, which would imply a pathway to “severe harm,” it has been classified as “High” risk. This means the model has the potential to amplify existing methods of cyberattacks or digital disruption.

Mia Glaese, vice president of research at OpenAI, said the model underwent rigorous third-party red teaming to test its safety boundaries. The discussion around safety is particularly pointed following rival Anthropic’s recent decision to limit the rollout of its Claude Mythos Preview model due to its prowess in identifying software vulnerabilities.

The release of GPT-5.5 is a clear volley in the escalating arms race between OpenAI, Google, and Anthropic. OpenAI released data claiming that GPT-5.5 consistently outperforms Google’s Gemini 3.1 Pro and Anthropic’s Claude Opus 4.5 across several industry benchmarks.

However, the ambitions for GPT-5.5 extend beyond simple performance scores. Brockman described the model as a stepping stone toward a superapp, a unified, Swiss Army knife program that integrates ChatGPT, the Codex coding engine, and a web browser into a single interface. This vision mirrors the everything app concept championed by Elon Musk for his platform, X, setting the stage for a clash of philosophies over the future of the digital desktop.

Despite the breakneck speed of recent releases, OpenAI Chief Scientist Jakub Pachocki characterized the last two years as “surprisingly slow.” He projected “extremely significant improvements” in the medium term, suggesting that the era of monthly model updates may become the new industry standard.

GPT-5.5 is available today for paid subscribers across Plus, Pro, Business, and Enterprise tiers, with an API rollout expected soon.

“GPT-5.5 delivers some of the strongest security numbers we’ve recorded so far, with low, stable vulnerability density across severities, which is encouraging for teams considering it for production use,” Joe Tyler, lead AI researcher at Sonar, said in an email. “Our evaluation also shows that it produces a lot of code with elevated cognitive complexity, minimal comments, and a noticeable concentration of concurrency bugs. The priority now is making sure review and testing keep pace with the model’s generated code before it reaches production — verification doesn’t go away just because the model gets better.”

TECHSTRONG AI PODCAST

SHARE THIS STORY