OpenAI released its most ambitious Codex update yet on May 29, 2026, bringing full Computer Use capabilities to Windows 11. The new mode transforms Codex from a chat-bound assistant into an agent that can directly manipulate local applications, files, terminals, and developer tools—all under real-time user supervision.

Announced via a joint blog post with Microsoft, Codex v2.5 for Windows taps into the native Windows Copilot Runtime and a hybrid AI architecture that mixes cloud-based models with on-device processing. The result is a coding agent that not only generates code but also executes multi-step workflows across IDEs, browsers, command prompts, and system settings.

“This is the biggest leap in developer tooling since the integrated debugger,” said Mira Murati, OpenAI’s CTO, during a press briefing. “Computer Use closes the gap between intent and action—Codex can now set up your environment, run tests, commit changes, and even refile a bug, all while you watch.”

What Computer Use Mode Actually Does

The core idea is agentic execution. Instead of pasting snippets into a separate IDE, developers give Codex a high-level task—say, “clone the repo, install dependencies, fix the linting errors in the utils folder, and open a pull request”—and the agent carries it out. Codex sees the screen, parses UI elements, and takes control of the keyboard and mouse when authorized.

During internal trials with 500 early-access developers, common workflows included:
- Automating local build-test-debug loops across VSCode, JetBrains, and terminal emulators.
- Navigating Windows File Explorer to organize project assets.
- Editing Office documents or Notion pages consumed by a project’s documentation pipeline.
- Logging into GitHub, GitLab, or Azure DevOps to create branches and PRs.
- Running PowerShell scripts to configure services, manage Docker containers, or toggle Windows features.

Each action appears in a translucent “supervision panel” that overlays the desktop. The user must explicitly approve any step that changes code, accesses the internet, or modifies system settings. By default, read-only operations (such as opening a file or reading a directory) are automatically allowed, but the permission model is fully customizable.

How the Windows Integration Works

Codex’s Computer Use mode relies on two Windows 11 subsystems that Microsoft has been refining since 2024. The first is the Windows Copilot Runtime, which exposes a set of APIs for AI models to interact with the OS, including screen understanding, accessibility tree parsing, and secure input injection. The second is the Local AI Accelerator, a driver stack optimized for NPU-offloaded vision and language models on devices with Qualcomm Snapdragon X, Intel Core Ultra, or AMD Ryzen AI processors.

When a developer types a task, Codex breaks it into a plan, sends visual state to a fine-tuned vision transformer that identifies windows, buttons, and text fields, then combines that with the accessibility tree to locate exact UI coordinates. This hybrid approach avoids brittle pixel-based macro recording. Instead, the agent recognizes controls semantically—it can click the “Run” button even if the window is resized or themed differently.

“We built this on top of the UI Automation framework that already powers Narrator and other assistive tools,” explained Panos Panay, Microsoft’s Chief Product Officer. “That means Codex works with any app that follows Windows accessibility guidelines—which covers most developer tools and mainstream productivity software.”

Security, Privacy, and Supervision

Unsupervised desktop automation raises obvious red flags. OpenAI and Microsoft engineered four layers of safeguards:

  1. Persistent supervision panel – Every action Codex takes is previewed in a floating, non-dismissable overlay. Critical actions (file deletion, registry changes, network access to new domains) require a two-factor confirmation: clicking “Approve” and entering a Windows Hello biometric check.

  2. Runspace isolation – By default, Computer Use runs inside a lightweight sandbox called a Codex Runspace. It’s a container that has access only to designated directories, environment variables, and network endpoints. Developers can expand its scope on a per-project basis, but the OS warns against removing all restrictions.

  3. Local-first processing – The vision model that interprets the screen runs entirely on-device via the NPU. Only the language model queries go to Azure OpenAI services, and even those can be routed through an enterprise’s private endpoint. Microsoft confirmed that no screenshots or UI snapshots leave the machine without explicit encryption and user consent.

  4. Audit trails – Every Computer Use session generates a step-by-step log with screenshots, accessible from the Codex dashboard. Enterprises can pipe these logs into Microsoft Purview for compliance reviews.

Nathaniel Whittemore, an independent AI security researcher, called the sandboxing “a solid first attempt” but warned that determined attackers could still exploit prompt-injection if the agent processes untrusted web content. “The risk is not that Codex goes rogue—it’s that a malicious HTML file could trick it into executing dangerous commands,” he wrote in his newsletter. OpenAI acknowledged the concern and said it will ship with a domain allow-list and a strict content-security policy that strips executable scripts from rendered pages before the vision model analyzes them.

System Requirements and Hardware Compatibility

Computer Use mode demands a Copilot+ PC or a system with a discrete NPU rated at 40+ TOPS. As of mid-2026, that includes:
- All Snapdragon X Elite and X Plus devices.
- Intel Core Ultra 200S and 300H-series chips with Intel AI Boost.
- AMD Ryzen AI 300 and AI 400 series processors.
- Workstations with NVIDIA RTX 5000 Ada GPUs (via a supplemental CUDA agent).

Minimum RAM is 16 GB, though 32 GB is recommended for multi-window navigation. Codex itself installs as a native ARM64 and x64 app from the Microsoft Store, pulling down about 8 GB of model files. An internet connection is required for initial authentication and optional cloud model access, but the core loop of screen understanding and action execution works offline once models are cached.

Early users with Snapdragon X Elite laptops report smooth performance, averaging 200-400 ms of latency between a click approval and the on-screen action. “It feels faster than a remote desktop session,” said Sara Vogel, a full-stack developer who tested the beta. “The NPU does the heavy lifting for the vision part, so my CPU usage barely blipped.”

Developer Community Reacts

Reactions on X and GitHub have been a mix of excitement and healthy skepticism. The ability to offload repetitive setup tasks has drawn the most praise.

“I used it to bootstrap a React Native project with Expo, Fastlane, and Detox—something that usually takes me two days of fiddling with environment variables. Codex did it in 22 minutes while I reviewed each step,” posted @alexcodes_js on X.

Others worry about the “black box” effect. “If Codex sets up my environment, I won’t know exactly what it did. When something breaks later, I’ll waste even more time troubleshooting,” commented redditor u/sysadmindev.

Microsoft and OpenAI seem to anticipate that pushback. The audit log includes not only screenshots but also the exact commands Codex issued and the resulting stdout/stderr. A new “Explain” button in the supervision panel expands any action into a natural-language summary of its intent and effect.

Open source contributor and Windows enthusiast Rafael Rivera noted on his blog that Codex’s Computer Use API is documented and will be accessible to third-party developers later this year. “Imagine Copilot in Teams being able to schedule a meeting by actually clicking through Outlook. This is far bigger than just coding,” he wrote.

Competing with Claude and Devin

OpenAI’s move puts it head-to-head with Anthropic’s Claude Computer Use (launched in late 2025 for macOS) and Cognition’s Devin, which has been agentic from the start. Codex on Windows, however, benefits from deep OS-level hooks that neither competitor can match on Microsoft’s platform.

Claude’s Computer Use relies on generic screenshot analysis and virtualized mouse/keyboard emulation, which often breaks when UI layouts change. Devin runs primarily in the cloud, limiting its ability to interact with local hardware or offline tools. Codex’s hybrid architecture gives it an edge in latency and reliability on Windows, though it’s currently Windows-exclusive—leaving macOS and Linux users waiting.

Microsoft’s Satya Nadella hinted that this is just the beginning: “We’re building the operating system for the agentic era. Every Windows API is becoming AI-addressable. Codex is the first application to demonstrate that at scale.” Analysts see this as Microsoft’s move to make Windows the default development platform for AI-augmented coding, potentially luring developers away from macOS.

What’s Next for Codex on Windows

OpenAI’s roadmap, shared during the Build 2026 conference, shows several fast-follow features:

  • Enterprise policy controls: IT admins will be able to define allowed actions via Group Policy or Intune, blocking destructive commands like disk formatting.
  • Team runbooks: Teams can share and version-control Computer Use scripts, turning common setup routines into one-click automations.
  • Voice-driven coding: Integration with Windows Voice Access will let developers narrate tasks and approve/reject actions hands-free.
  • Third-party plugins: Tooling vendors like Docker, Postman, and Figma are already developing Computer Use plugins that expose precise action schemas for safer automation.

Ryan J. Slack, a Gartner analyst, predicts that by 2028, 40% of professional developers will use an agentic coding tool daily. “The productivity gains are undeniable. The challenge is building trust. The transparency tools OpenAI and Microsoft shipped with this release are the right first step.”

For Windows 11 users, the update is rolling out gradually via the Codex app’s auto-updater. Those who don’t see it yet can force the update by running winget upgrade OpenAI.Codex in PowerShell. A 14-day free trial is available for new subscribers; existing ChatGPT Plus, Team, and Enterprise seats include Computer Use at no extra cost.

As the line between coding assistant and autonomous agent blurs, one thing is clear: the desktop that developers sit in front of every day just got a whole lot smarter.