CoPhish Attack: How Microsoft Copilot Studio Enables OAuth Phishing & Token Theft

The CoPhish attack technique weaponizes Microsoft Copilot Studio to execute sophisticated OAuth phishing attacks from trusted Microsoft domains, harvesting tokens for account takeover. While Microsoft is implementing policy changes and platform hardening, significant security gaps remain that require organizations to implement layered defenses and treat low-code platforms as critical security boundaries.

A sophisticated new phishing technique dubbed "CoPhish" has emerged, weaponizing Microsoft's legitimate Copilot Studio platform to execute convincing OAuth consent attacks that bypass traditional security awareness. Security researchers at Datadog Security Labs have documented how attackers can create malicious Copilot Studio agents hosted on Microsoft's own copilotstudio.microsoft.com domain, then use these seemingly trustworthy chatbots to present authentic-looking sign-in prompts that harvest OAuth tokens for account takeover. This attack vector represents a dangerous convergence of low-code automation platforms, persistent OAuth consent vulnerabilities, and the psychological trust users place in first-party Microsoft domains.

The CoPhish Attack Chain: Technical Breakdown

The CoPhish technique follows a meticulously crafted sequence that leverages Microsoft's own infrastructure against users. According to Datadog's research, the attack begins with an attacker provisioning or compromising a Copilot Studio agent—a low-code, user-configurable chatbot that organizations use to create custom AI assistants. These agents include a built-in "Login" workflow that can be configured to redirect users through OAuth consent flows.

When a victim interacts with the malicious agent, they're presented with what appears to be a legitimate Copilot interface hosted on Microsoft's infrastructure. The agent's Login topic is configured to invoke an OAuth flow requesting sensitive delegated permissions such as Mail.ReadWrite, Calendars.ReadWrite, or Files.Read.All. Because the initial interaction occurs on a Microsoft domain (copilotstudio.microsoft.com), users are significantly more likely to trust the prompt and proceed with authentication.

As documented in the WindowsForum discussion, the technical execution involves several critical steps: "The victim is redirected through legitimate Microsoft authentication endpoints (for example token.botframework.com as part of the bot framework flow) and is presented with an OAuth consent dialog. Because the page is initiated from Microsoft infrastructure and may display expected icons and labels, users can overlook the 'not verified' publisher warning."

After consent is granted, Microsoft issues an access token that can be redirected to attacker-controlled endpoints. The Copilot agent's low-code automation capabilities then enable immediate action—either exfiltrating the token to third-party servers or directly calling Microsoft Graph APIs to act on the user's behalf. This automation component transforms what might otherwise be a simple token theft into an active compromise vector capable of reading mail, harvesting files, or manipulating calendars in real-time.

Why Copilot Studio Makes This Attack Particularly Effective

Several factors combine to make Copilot Studio an exceptionally effective vehicle for OAuth phishing attacks. First and foremost is the domain trust factor—users have been trained to trust Microsoft domains, and security awareness programs often emphasize checking URLs for legitimacy. When the attack originates from copilotstudio.microsoft.com, even security-conscious users may lower their guard.

Second, the low-code nature of Copilot Studio enables rapid deployment and automation. As noted in community discussions: "Low-code 'topics' can be configured to perform immediate automation—exfiltrating tokens or invoking Graph APIs without additional attacker infrastructure." This reduces the attacker's operational overhead and increases the speed of compromise.

Third, the attack leverages legitimate OAuth flows rather than credential harvesting. Traditional phishing attacks typically involve fake login pages that capture usernames and passwords, which can be detected by security tools monitoring for credential submission to suspicious domains. CoPhish, however, uses Microsoft's own authentication endpoints, making detection more challenging for basic security controls.

Microsoft's Response and Policy Changes

Microsoft has acknowledged the CoPhish vulnerability and is planning product updates to address the underlying issues. According to both the original source and community discussions, Microsoft is implementing governance changes to Copilot Studio and consent experiences. The company's response emphasizes that the attack relies heavily on social engineering while committing to platform hardening that will reduce misuse potential.

More significantly, Microsoft has been iteratively tightening its Entra ID application consent policies. A July 2025 update established the default setting of "microsoft-user-default-recommended," which automatically blocks access to permissions considered high-risk, such as Sites.Read.All and Files.Read.All, unless administrator approval is granted. Further policy updates scheduled for late October and November 2025 will tighten what users can self-consent to by default.

However, significant gaps remain. As highlighted in the WindowsForum analysis: "Two notable gaps remain: Privileged administrators (Cloud Application Administrator, Application Administrator, etc.) retain the ability to consent to any permissions for any application, making them high-value targets. In many tenants, member users can still consent to certain delegated scopes such as Mail.ReadWrite or Calendars.ReadWrite—which are powerful enough to fuel malicious activity if misused."

Community Concerns and Real-World Implications

The WindowsForum discussion reveals deep concerns among IT professionals about the broader implications of this vulnerability. Community members note that this isn't merely a theoretical exploit but represents a practical threat vector that combines several high-value attack primitives. The discussion emphasizes that "the mechanics combine several high-value attack primitives—a legitimate hosting domain, an OAuth consent flow that issues long-lived tokens or privileges, and low-code automation that can immediately act on obtained tokens."

Security practitioners in the forum express particular concern about the normalization of such attacks: "This is not a theoretical quirk confined to lab demos. That mix transforms what looks like a benign chatbot into an account takeover vector capable of reading mail, harvesting files, manipulating calendars, or even performing administrative actions depending on the granted scopes."

Another recurring theme in community discussions is the challenge of user education. As one contributor noted: "Human factors remain the Achilles' heel. Consent dialogs are complex and users often click through when under time pressure or when a page looks legitimate. Any UX that relies primarily on visual trust cues is vulnerable to mimicry, especially when hosted on first-party domains."

Detection and Mitigation Strategies

Organizations need to implement layered defenses to protect against CoPhish and similar OAuth consent attacks. Based on both the original research and community recommendations, here are prioritized mitigation strategies:

Immediate Actions (First 48 Hours)

Enforce Least-Privilege for Admin Consent: Restrict Cloud Application Administrator and Application Administrator roles to essential personnel only and implement approval workflows before granting consent to new applications.
Configure Entra ID Consent Policies: Apply Microsoft's managed default consent policy or create stricter custom policies that prevent users from consenting to high-risk permissions without administrator approval.
Monitor Copilot Studio Activity: Implement logging and alerting for new Copilot Studio agent creation, especially those with demo URLs or public sharing enabled.
Require Phishing-Resistant MFA: Implement FIDO2/WebAuthn or platform passkeys for all administrative roles to prevent adversary-in-the-middle attacks.

Short-Term Measures (Days to Weeks)

Audit Existing App Consents: Review recently consented applications, service principals, and redirect URIs in your tenant. Revoke any suspicious or unnecessary consents.
Implement Monitoring for Suspicious Activity: Create alerts for OAuth consent events granting high-scope permissions, particularly those occurring shortly after Copilot agent interactions.
Harden Application Registration Policies: Remove wildcard validDomains from trusted manifests and require verified publisher status for applications requesting sensitive permissions.

Long-Term Security Posture

Integrate Copilot Governance into IAM: Bring Copilot connectors and agent permissions into Microsoft Purview or Data Loss Prevention (DLP) policies, requiring administrative approval for connectors that expose Graph API scopes.
Conduct Regular Security Testing: Include CoPhish-style attacks in red team exercises and tabletop simulations to validate detection capabilities and incident response procedures.
Implement Continuous Education: Develop ongoing security awareness programs that specifically address OAuth consent risks and the evolving nature of phishing attacks leveraging legitimate platforms.

The Broader Security Implications

The CoPhish vulnerability highlights systemic risks in the expanding ecosystem of low-code and AI-powered platforms. As organizations increasingly adopt tools like Copilot Studio to boost productivity, they're inadvertently expanding their attack surface. The very features that make these platforms valuable—rapid deployment, automation capabilities, and integration with core services—also make them attractive to attackers.

Community discussions on WindowsForum point to deeper concerns about platform security models: "Low-code and agent frameworks expand the attack surface. Copilot Studio's very utility—rapid authoring of automated tasks and connectors—is what makes it attractive to attackers. Removing or restricting that utility to preserve security reduces productivity, and that tradeoff must be managed carefully."

This tension between security and usability is particularly acute in AI-powered platforms where traditional security controls may not adequately address novel attack vectors. The CoPhish technique demonstrates how attackers can weaponize legitimate business tools, bypassing security controls that focus on external threats while trusting internal or first-party services.

Microsoft's Security Evolution and Remaining Challenges

Microsoft's response to CoPhish reflects the company's evolving approach to cloud security. The implementation of managed consent policies represents a significant improvement over the previous model where each organization was responsible for configuring complex security settings. By applying security defaults at the platform level, Microsoft can protect organizations that may lack dedicated security expertise.

However, as noted in both the original article and community discussions, significant challenges remain. The concentration of administrative privileges creates systemic risk: "Administrative role concentration is a systemic risk. The fact that an Application Administrator can consent to any permission for any application means attackers disproportionately benefit from compromising a small number of high-privilege users. This is a governance problem as much as a technical one."

Additionally, the user experience of consent dialogs continues to present challenges. Despite improvements in publisher verification and permission descriptions, users often struggle to distinguish between legitimate and malicious consent requests, especially when both originate from trusted domains.

Future Outlook and Recommendations

Looking forward, both security researchers and community members agree on several necessary improvements. Platform vendors need to implement stronger safeguards for low-code platforms, including requiring tenant-level whitelisting for public sharing and restricting the types of redirects agents can perform by default. Enhanced telemetry and alerting capabilities should be provided out-of-the-box to help organizations detect suspicious activity.

From a user experience perspective, there's consensus that consent provenance and publisher verification need to be more prominent and harder to bypass. Suggestions include normalizing display strings to prevent homograph attacks using invisible Unicode characters and making verification status more visually distinct in consent dialogs.

For organizations using Microsoft 365 and Copilot Studio, the key takeaway is the need for proactive security governance. As summarized in the WindowsForum discussion: "Organizations must treat Copilot Studio and similar agent platforms as first-class security boundaries: harden permissions by default, reduce administrative consent scope, monitor agent creation and token flows, and run adversarial tests to validate detection."

The CoPhish vulnerability serves as a stark reminder that as AI and automation platforms become more integrated into business workflows, security considerations must evolve accordingly. The balance between productivity and security will continue to be tested, requiring vigilant engineering, clear governance frameworks, and disciplined operational controls to protect against increasingly sophisticated attack vectors that weaponize legitimate business tools.

Windows Versions

Microsoft Services

CoPhish Attack: How Microsoft Copilot Studio Enables OAuth Phishing & Token Theft

Table of Contents

The CoPhish Attack Chain: Technical Breakdown

Why Copilot Studio Makes This Attack Particularly Effective

Microsoft's Response and Policy Changes

Community Concerns and Real-World Implications