Microsoft AI Red Team updates agentic AI failure taxonomy with seven new attack modes after a year of red teaming
By
Microsoft AI Red Team
Hot, fresh, and worth queueing round the block for.
Summary
The Microsoft AI Red Team has updated its Taxonomy of Failure Modes in Agentic AI Systems, originally published in April 2025, based on 12 months of real-world red teaming experience. The update introduces seven new failure modes—including supply chain compromise and goal hijacking—alongside practical mitigations. The v1.0 taxonomy had identified novel failure modes unique to agentic systems (agent compromise, injection, impersonation, flow manipulation) and amplified existing ones (memory poisoning, cross-domain prompt injection). A surge in real-world attacks against agentic AI systems has driven this revision, reshaping risk assessment approaches for teams building and deploying autonomous AI agents.
Key quotes
· 3 pulledThe v1.0 taxonomy was largely forward-looking, built on practitioner interviews, cross-company threat modeling, and our own early operational experience.
It identified novel failure modes unique to agentic systems (agent compromise, injection, impersonation, flow manipulation) alongside existing failure modes materially amplified in agentic contexts (memory poisoning, cross-domain prompt injection).
Based on 12 months of red teaming, this update introduces seven new failure modes, from supply chain compromise to goal hijacking, and the practical mitigations teams need now.
You might also wanna read
Research on AI Failure Modes: How Misalignment Scales with Model Intelligence and Task Complexity
This research paper examines how AI system failures scale with model intelligence and task complexity, exploring whether failures manifest a

Agentic AI Enterprise Scaling: Insights from 70+ Founders and Practitioners
This article explores the current state of agentic AI through insights from over 70 founders and practitioners, examining how AI startups ar
New Benchmark Reveals High Rates of Outcome-Driven Constraint Violations in Autonomous AI Agents
Researchers introduce a new benchmark for evaluating autonomous AI agents' safety, specifically focusing on outcome-driven constraint violat
The 8 Levels of Agentic Engineering: A Framework for Effective AI Coding Implementation
The article presents a framework of 8 progressive levels for effectively utilizing AI coding capabilities, arguing that while AI's coding ab
How to Integrate Existing AI Agents into Microsoft Teams Using TypeScript SDK
This article provides a technical guide for developers on how to integrate existing AI agents or bots into Microsoft Teams without rewriting
microsoft.github.io·1mo agoAgent Arena: Testing AI Agents Against Prompt Injection Attacks
Agent Arena is a testing platform that allows developers to evaluate their AI agents' vulnerability to prompt injection attacks. The tool pr
