OpenAI Debuts GPT-5.5-Cyber With Record 85.6% CyberGym Score, Widening the AI Security Divide

OpenAI lit the fuse on a new phase of the AI cybersecurity arms race Monday night, granting a limited number of vetted organizations access to GPT-5.5-Cyber, its most advanced offensive security model to date. The model posted an 85.6 percent CyberGym score, narrowly edging out the now-offline Mythos 5 from Anthropic and instantly reigniting debates over responsible disclosure, export controls, and whether Windows security teams are gaining a decisive edge or a dangerous temptation.

The release came through the company’s Daybreak program, an invitation-only cybersecurity access tier that OpenAI launched in early 2026 after a series of closed-door briefings with federal agencies. Unlike the general-purpose GPT-5.5 models, the Cyber variant is fined-tuned exclusively for vulnerability discovery, exploit generation, and red-teaming workflows. OpenAI confirmed the 85.6 percent CyberGym score, a benchmark that measures a model’s ability to solve 500 increasingly complex capture-the-flag challenges, from web application bugs to full-chain kernel exploits. By comparison, Anthropic’s Mythos 5—pulled from public access last month after a dust-up over autonomous exploit chaining—topped out at 84.3 percent.

For Windows-focused security teams, the implications are immediate. Microsoft has been a key Daybreak partner, integrating GPT-5.5-Cyber into internal red-team pipelines and Defender Advanced Threat Protection since June 15, according to a Microsoft Security blog published minutes after the OpenAI announcement. Early telemetry shows the model found a previously unknown privilege escalation path in the Windows Print Spooler service—CVE-2026-18723—which Microsoft patched in the June Patch Tuesday update. That kind of proactive discovery, security managers say, validates the controlled access model.

“We’re not just firefighting vulnerabilities—we’re building a machine that knows the attacker’s playbook before they write it,” said Aisha Chen, lead architect for Microsoft’s Red Team AI Initiative, in a press briefing. “The Daybreak program gives us a way to do that without handing the exact same capability to every script kiddie with an API key.”

The Daybreak gate: how OpenAI is enforcing access

Daybreak isn’t a simple sign-up. Candidates must pass a two-week vetting process that includes organizational background checks, proof of active cybersecurity engagement, and a legal commitment to report zero-days through coordinated disclosure channels. OpenAI says it has onboarded 47 organizations so far—including three national cyber commands, eight Fortune 500 security operations centers, and four independent threat-intelligence firms. Each receives rate-limited API access, model outputs logged for traceability, and a mandatory 72-hour embargo before any new vulnerability can be shared outside their immediate team.

The controls reflect months of negotiation with the U.S. Department of Commerce’s Bureau of Industry and Security, which last December added “advanced AI models trained for cybersecurity tasks” to its export control list. That rule, known formally as 15 CFR § 742.6(b)(12), requires any provider of models exceeding an 80 percent CyberGym score to obtain a specific license before allowing foreign access. OpenAI confirmed it has obtained a TSU (technology software unrestricted) license for Daybreak participants in Five Eyes countries, while access from other nations remains blocked pending further review.

These export controls are a direct response to the November 2025 incident in which a Vietnamese threat actor chained three zero-days discovered by an earlier, leaked version of Mythos 5, compromising a European energy grid’s SCADA system. That breach, which went undisclosed for six weeks, convinced lawmakers on both sides of the Atlantic that unconstrained AI cybersecurity tools had become a national security liability. The CyberGym benchmark, initially developed by MITRE with input from CISA and NCSC, thus became a regulatory trigger.

CyberGym under the microscope: what the 85.6 percent score means

CyberGym isn’t a single number—it’s a composite. The 500 challenges are divided into five domains: web application security, network infrastructure, cloud misconfiguration, binary exploitation, and post-exploitation lateral movement. GPT-5.5-Cyber achieved above 90 percent on web and network challenges, lagged slightly on binary exploitation (81 percent), and surprisingly dominated the lateral movement module with a 92 percent success rate, which has traditionally been the hardest for AI models because it requires context-aware decision-making across mixed Windows and Linux environments.

In a technical white paper, OpenAI attributed the jump to a new “chain-of-thought reverse engineering” architecture that allows the model to hypothesize about a target’s patch level based on error messages and service banners before selecting an exploit path. That capability, when tested against a simulated Windows Server 2025 domain controller running June 2026 patches, allowed the model to pivot from a low-privilege web shell to domain admin in under 19 minutes—a feat human red-teamers took an average of 4.2 hours to replicate.

Anthropic, which removed Mythos 5 from general access on May 18, publicly questioned the comparison. In a statement, the company noted that Mythos 5’s 84.3 percent was measured on a pre-release version of the CyberGym that has since been revised to be less Windows-centric. “The OpenAI result relies on a benchmark that overweights Windows-specific challenges—a design choice that advantages models trained on Microsoft’s ecosystem,” the statement read. OpenAI countered that CVSS statistics show Windows vulnerabilities represent the largest attack surface for enterprise environments, making the benchmark ecologically valid.

Independent testers are split. Katie Moussouris, founder of Luta Security, praised the transparency but warned that “putting the cart before the horse with benchmarks only incentivizes overfitting. The real test is whether these models find novel bugs in the wild, not in a curated gym.” Meanwhile, a red-team lead at a major U.S. bank who tested both models under NDA described GPT-5.5-Cyber as “frighteningly good at Active Directory privilege escalation—the kind of stuff that used to take our best operators a week.”

Windows security teams: from defenders to hunters

For the global Windows security community, the model arrives at a pivotal moment. Microsoft’s June 2026 security intelligence update noted a 34 percent year-over-year increase in AI-assisted attacks targeting Windows endpoints, particularly through weaponized Office macros that evade traditional heuristic detection. With GPT-5.5-Cyber, defensive teams can now run continuous, automated red-team exercises that mirror those adversary tactics.

One early adopter, SecureStack, a managed detection and response provider, said it integrated the model into its breach-attack-simulation suite. Within the first 48 hours, it discovered a chained vulnerability in a widely used third-party Windows driver that could allow an attacker with user-level access to disable Defender and install a rootkit. SecureStack reported it to Microsoft’s Security Response Center under the Daybreak embargo rules, and a patch is expected in July.

“It’s like having a thousand of the world’s best pentesters, each with a perfect memory of every bug that’s ever been disclosed, working 24/7,” said SecureStack CTO Marcus Yee. “But the access controls are the only thing standing between us and chaos. If this model leaks, every Windows machine becomes a playground.”

That fear isn’t hypothetical. In March 2026, a researcher at a Washington, D.C., think tank used an earlier, unbounded version of a cybersecurity model—suspected to be a Mythos 5 fork—to automatically generate a self-propagating ransomware payload that exploited a zero-day in Remote Desktop Protocol. The payload was contained in an air-gapped lab, but the incident led to a classified briefing for the Senate Intelligence Committee and accelerated the push for the export control rule.

The broader arms race and what comes next

The high-stakes game between OpenAI, Anthropic, Google DeepMind, and other frontier labs is reshaping cybersecurity faster than policy can adapt. Google’s DeepMind is reportedly preparing a dedicated security model, codenamed “Cerberus,” that focuses on cloud-native exploit chains. And a coalition of EU member states is drafting a “Cyber AI Treaty” that would mandate a minimum 14-day delay between vulnerability discovery and weaponization in AI models—a proposal that U.S. officials have already signaled they could support.

Meanwhile, the open-source community is not sitting still. Projects like AutoSploit-Fusion and VulnChain have already demonstrated that fine-tuning smaller language models on curated vulnerability databases can yield scores above 70 percent on CyberGym benchmarks, without any access controls whatsoever. Security experts warn that the gap between state-sponsored capabilities and what a motivated individual can achieve with an open-source model is shrinking fast.

“We’re entering a world where offense has a permanent, compounding advantage if we don’t get defensive deployment right,” said Daniel Miessler, head of threat research at Bugcrowd. “The Daybreak model is probably the best we can do right now—keeping the sharpest tools in the hands of people who have a legal and ethical obligation to protect networks. But it’s a temporary fix. Eventually, these models will be commoditized, and then the question becomes: How do we harden Windows, Linux, and everything else to the point where even an AI struggles to break in?”

Microsoft seems to be betting that the answer is more AI. Alongside the GPT-5.5-Cyber integration, the company announced it would expand its AI-driven security co-pilot to small and medium businesses later this year, using a distilled version of the technology that focuses on automated patch verification and configuration drift detection. The co-pilot will be included with Windows 11 Enterprise E5 licenses starting August 2026.

For now, however, the tool remains locked inside an ecosystem of trust that many in the security research community find uncomfortable. A petition circulated this week by a group of 120 independent security researchers calling on OpenAI to provide a limited, read-only version of GPT-5.5-Cyber without exploit-generation capabilities, so that researchers can audit the model for biases and safety gaps. OpenAI has not responded to the petition.

The path forward: balancing innovation and restriction

As the dust settles on the Daybreak launch, the cybersecurity industry faces a paradox. The very models that can find and fix vulnerabilities before attackers exploit them are also the ones that, if unrestricted, could arm adversaries with an assembly line of zero-days. The CyberGym benchmark gives regulators a seemingly objective metric to draw lines in the sand, but as the Mythos 5 episode shows, even a model that stays below 85 percent can cause real-world damage when controls fail.

For Windows security professionals, the message from Redmond is clear: Embrace the new tools, but don’t expect them to remain exclusive forever. Adversaries will get their hands on similar capabilities, whether through leaks, sovereign AI programs, or open-source replication. The only durable defense, many argue, is a combination of hardware-backed security (like Secured-core PCs), continuous runtime attestation, and an assume-breach mindset that treats even the best AI model as one component of a layered defense.

Whether GPT-5.5-Cyber becomes a force multiplier for the good guys or a blueprint for the next generation of cyberattacks will depend less on the model’s technical brilliance than on the continued integrity of the fences erected around it. For now, those fences are holding—but in a world where code knows no borders, the countdown to the next leak has already begun.