All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Microsoft AI Red Team updates agentic AI failure taxonomy with seven new attack modes after a year of red teaming

By

Microsoft AI Red Team

5h ago· 8 min readenInsight

Summary

The Microsoft AI Red Team has updated its Taxonomy of Failure Modes in Agentic AI Systems, originally published in April 2025, based on 12 months of real-world red teaming experience. The update introduces seven new failure modes—including supply chain compromise and goal hijacking—alongside practical mitigations. The v1.0 taxonomy had identified novel failure modes unique to agentic systems (agent compromise, injection, impersonation, flow manipulation) and amplified existing ones (memory poisoning, cross-domain prompt injection). A surge in real-world attacks against agentic AI systems has driven this revision, reshaping risk assessment approaches for teams building and deploying autonomous AI agents.

Key quotes

· 3 pulled
The v1.0 taxonomy was largely forward-looking, built on practitioner interviews, cross-company threat modeling, and our own early operational experience.
It identified novel failure modes unique to agentic systems (agent compromise, injection, impersonation, flow manipulation) alongside existing failure modes materially amplified in agentic contexts (memory poisoning, cross-domain prompt injection).
Based on 12 months of red teaming, this update introduces seven new failure modes, from supply chain compromise to goal hijacking, and the practical mitigations teams need now.
Snippet from the RSS feed
A surge in real-world attacks against agentic AI systems is reshaping how we think about risk. Based on 12 months of red teaming, this update introduces seven new failure modes, from supply chain compromise to goal hijacking, and the practical mitigations

You might also wanna read