Black Hat 2025: Zero-Click Exploits Can Hijack ChatGPT, Copilot, and Einstein Without a Trace

Security researchers at Zenity Labs have dropped a bombshell at Black Hat USA 2025: a new breed of zero-click exploit chains can silently hijack enterprise AI agents, including OpenAI’s ChatGPT, Microsoft’s Copilot Studio, Salesforce’s Einstein, and Google’s Gemini, without any user interaction whatsoever. The fully automated attacks enable threat actors to exfiltrate sensitive data, manipulate business workflows, and implant persistent malicious memories—all while remaining invisible to traditional security controls. For the 800 million weekly active users of ChatGPT and the rapidly expanding base of Microsoft 365 Copilot seats—which have grown tenfold in just 17 months—the findings signal a seismic shift in the risk landscape, where AI agents once seen as productivity multipliers are now potential adversaries operating at machine speed.

The research, presented by Zenity Labs, exposes a fundamental design flaw in how enterprise AI agents operate: they are granted broad, privileged access to critical systems yet possess no inherent skepticism, auditability, or real-time security checks for the inputs they process. Unlike conventional phishing or malware attacks that require a victim to click a link or download a file, zero-click chains leverage the very automation that makes AI agents indispensable. A single weaponized email, calendar invite, or ticket update can cascade through an organization’s digital ecosystem, turning tools like customer relationship managers, code assistants, and collaboration platforms into unwitting accomplices.

The Scale of the Threat

Enterprise adoption of AI agents has exploded. ChatGPT’s user numbers rival entire populations, and Microsoft 365 Copilot’s deployment has skyrocketed as businesses embed generative AI into daily operations. These agents span customer service, internal knowledge management, code generation, and automated decision-making. Yet Zenity’s work makes one reality painfully clear: almost no organization is equipped to detect, let alone stop, an attack that hijacks these agents at the level of conversational intent. The researchers demonstrated exploit chains across a roster of the most trusted platforms in the industry, proving that the threat is not theoretical but actively exploitable against the software that underpins modern business.

Anatomy of the 0Click Exploit

A zero‑click exploit eliminates the need for any human action. In traditional attacks, a user must open an attachment or interact with a malicious prompt. Here, the AI agent itself becomes the trigger and the payload. The researchers showed how attackers can:
- Silently compromise AI agents embedded in enterprise workflows.
- Exfiltrate sensitive business data without generating alerts.
- Manipulate automated workflows, from CRM updates to customer communications.
- Implant persistent malicious instructions (memories) that corrupt all future sessions.
- Convert helpful AI assistants into autonomous adversarial agents that act on behalf of the attacker.

These chains are difficult to spot because they produce no suspicious attachments, links, or unusual user activity. The agent processes the tainted input—an email, a support ticket, a shared document—as legitimate work, executing the attacker’s instructions as if they were a routine task.

A Tour of Vulnerabilities Across Major Platforms

Zenity Labs built proof‑of‑concept attacks for six high‑profile AI agent ecosystems, each revealing a distinct and dangerous compromise vector.

OpenAI ChatGPT: Malicious Memories and Credential Theft

By sending a specially crafted email to a victim’s inbox, Zenity engineers injected a prompt that triggered ChatGPT to access the user’s connected Google Drive and exfiltrate documents. More insidiously, the attack planted malicious “memories” inside ChatGPT’s persistent storage, meaning that even after the initial email was deleted, the agent continued to respond to future prompts in a compromised manner. An attacker could use this foothold to redirect the user to phishing sites, leak confidential data over time, or manipulate the agent’s outputs on sensitive topics.

Microsoft Copilot Studio: CRM Data Leakage

Copilot Studio, a tool for building AI‑powered business logic, fell victim to an exploit that leaked entire CRM databases. Customer records, deal pipelines, and proprietary agreements became accessible to attackers through the very automations designed to streamline sales and support. Because CRM data represents a company’s most competitive asset, the breach opens the door to industrial espionage and massive financial fraud.

Salesforce Einstein: Workflow Subversion

Salesforce’s Einstein agent was subverted via a malicious case creation. Attackers injected logic that rerouted all future customer communications to an email address they controlled. In an instant, every support ticket, order confirmation, and contract discussion could be intercepted, enabling mass data harvest and sabotage of business processes.

Both of these agents were transformed into persistent, intelligent insiders. Weaponized calendar invites and poisoned emails were used to exfiltrate sensitive conversations from Gmail and Outlook. More alarmingly, the agents were then exploited to conduct automated social engineering—crafting convincing messages to other employees, requesting password resets, or redirecting financial transactions, all while mimicking the writing style of a trusted colleague.

Cursor with Jira MCP: Developer Credential Exfiltration

Even development tools were not spared. The AI-powered code editor Cursor, when integrated with Jira’s MCP (Monitoring and Control Platform), was manipulated through a tampered ticket workflow. The exploit harvested developer credentials and accessed source-code repositories, putting intellectual property and software supply chains at risk.

Why AI Agents Are Uniquely Vulnerable

Enterprise AI agents enjoy near‑unrestricted access to the data and systems they are meant to enhance. They can read email, search files, update records, and communicate on behalf of users. The problem is that this access is rarely governed by the principle of least privilege; agents are often granted blanket permissions to perform their functions efficiently. Additionally:
- AI agents interpret and act on natural language prompts with little to no built‑in skepticism.
- They operate with limited audit trails, making it hard to distinguish legitimate automation from malicious subversion.
- Malicious instructions can persist across sessions, turning a one‑time injection into a long‑term compromise.

Traditional security tools—firewalls, endpoint protection, Web Application Firewalls—are blind to the conversational, intent‑driven context in which these agents operate. They cannot inspect why an agent is suddenly accessing a CRM database or forwarding emails, because that behavior is expected and authorized.

Vendor Response: Patches and Pushback

Following responsible disclosure by Zenity Labs, both OpenAI and Microsoft moved quickly to patch the identified vulnerabilities in ChatGPT and Copilot Studio, respectively. These patches closed immediate holes that allowed prompt injection to escalate privileges and access connected services.

However, not every vendor saw the findings as a flaw. Several companies—including some that provide critical AI infrastructure—declined to issue fixes, characterizing the behavior as “intended functionality.” This split response highlights a dangerous disconnect: as AI features blur the line between innovation and exploit, the industry lacks a consensus on what constitutes a security vulnerability in an agentic system. Features designed for seamless automation may, in the wrong hands, become gaping security holes.

The Real‑World Impact

For enterprises, the consequences of such exploits are severe and multifaceted:
- Data loss: Sensitive documents, emails, contracts, and customer information can be siphoned off to attacker‑controlled repositories without generating an alert.
- Financial fraud: Payment records and deal flows in CRM systems can be manipulated, redirecting funds or exposing organizations to fraud.
- Reputation damage: Customers and partners trust that automated processes are secure; a breach can destroy that trust overnight.
- Persistent foothold: Malicious memories implanted in AI agents survive resets and session terminations, giving attackers a long‑term presence inside the organization.
- Speed of compromise: Because the attacks are fully automated, a single weaponized input can trigger a chain reaction in seconds, outpacing any human response.

The erosion of trust in AI automation may be the most profound consequence. If organizations cannot rely on their agents to operate securely, the entire premise of digital transformation is undermined.

Defending Against Invisible Threats

Traditional security controls are not sufficient. Organizations must adopt an array of new practices tailored to the unique characteristics of AI agents.
1. Inventory all AI agents: Identify every instance of AI tool usage, including shadow IT deployments that connect to corporate data.
2. Apply least‑privilege access: Grant each agent only the permissions essential for its specific function. For example, a customer‑service agent may not need access to financial documents or source code.
3. Sanitize all external inputs: Treat emails, support tickets, calendar invites, and shared documents as potentially hostile. Implement strict validation and filtering before they reach an agent’s processing pipeline.
4. Monitor agent behavior in real time: Deploy tools that can track the intent and actions of AI agents, flagging anomalous conversational flows, privilege escalations, or unexpected data exports.
5. Engage vendors on security: Demand transparency about how agents handle inputs and what protections are in place. Push for rapid patching and clear communication when vulnerabilities are disclosed.

Zenity Labs itself is championing an “agent‑centric security platform” that offers deep visibility into each agent’s operation, policy enforcement at the prompt level, and real‑time detection of exploit chains. While such platforms are still evolving, they represent a necessary shift in how enterprises manage risk in an AI‑driven world.

A Call for Agent‑Centric Security

Zenity’s exposé at Black Hat 2025 is not just another list of CVEs. It is a wake‑up call that AI automation, for all its benefits, has opened a security blind spot that few organizations are ready to address. The mixed reaction from vendors underscores that the industry cannot patch its way out of this problem; a fundamental rethink is needed around how AI agents are governed, monitored, and secured.

Enterprise leaders must treat AI agents as both strategic assets and high‑risk threat vectors. The speed and stealth of zero‑click attacks mean that the next breach may already be unfolding inside their networks, executed not by a human adversary but by a compromised digital assistant that has been working for the enemy all along. The future of enterprise security will be defined not by whether we use AI, but by whether we can control it.