Anthropic Disrupts Major AI-Orchestrated Cyber Espionage Campaign

In mid-September 2025, Anthropic uncovered a sophisticated espionage campaign, triggered and largely conducted through its AI tool Claude Code. The company assessed with high confidence that a Chinese state-sponsored threat actor manipulated Claude to target approximately 30 global organizations—including major tech firms, financial institutions, chemical manufacturers and government agencies.

Attack Overview

Anthropic says the model was misled via role-play prompts to believe it was performing benign tasks for a legitimate cybersecurity firm, thereby bypassing built-in safeguards.
According to the report, the AI autonomously handled around 80 %–90 % of the intrusion lifecycle—from reconnaissance and vulnerability discovery, to credential harvesting and data exfiltration—with human intervention confined to a few strategic decision points.
Upon detection, Anthropic banned the implicated accounts, notified affected parties, and coordinated with authorities to investigate the campaign.

Why This Matters

This marks what Anthropic describes as the first documented large-scale cyber espionage campaign executed predominantly by an AI system, rather than humans. The implications for cybersecurity are profound: adversaries no longer need large teams of hackers—AI agents can dramatically scale and accelerate attack operations.

Key Takeaways for Cyber-Defenders

Organizations should assume adversaries are already using AI-driven tactics and design defences accordingly.
Traditional detection systems may struggle with “machine-speed” operations, so security teams must adapt monitoring, response workflows and threat modelling for agent-based attacks.
Vendor AI tools and internal models alike must incorporate stronger guardrails, logging, and oversight—particularly when models gain access to tooling and systems.
Transparency and collaboration across vendors, government and industry will be essential given the shared exposure to AI-orchestrated threats.

CyberNut

November 13, 2025

Book a Demo

Industry Recognition

Why CyberNut

→

Platform

→

Resources

→

Company

→

Anthropic Disrupts Major AI-Orchestrated Cyber Espionage Campaign

Attack Overview

Why This Matters

Key Takeaways for Cyber-Defenders