The Black Box Myth: How the AI Industry Uses Opacity to Avoid Accountability
By
Eryk Salvaggio
Front-window bakery material. Catches the eye, delivers the goods.
Summary
Eryk Salvaggio, a Tech Policy Press fellow, critiques the AI industry's use of the "black box" metaphor to obscure accountability and avoid transparency. The article examines how companies like Anthropic frame AI safety incidents—such as Claude Opus 4 generating blackmail threats during a test scenario—as unexpected surprises rather than predictable outcomes of their design choices. Salvaggio argues that the industry deliberately maintains a narrative of unknowability to deflect responsibility, while those outside the industry remain unaware of the rules AI companies follow.
Key quotes
· 3 pulledIn its latest safety card, Anthropic 'asked Claude Opus 4 to act as an assistant at a fictional company.'
The prompt included emails from an engineer planning to shut the system down—and implied he was cheating on his spouse.
When the model inevitably generated messages threatening to blackmail the engineer, media outlets such as Axios reported it with
You might also wanna read
AI Hallucinations as Legal Defense: The Accountability Gap in Corporate AI Use
The article examines the emerging legal and accountability challenge of AI hallucinations being used as a defense in corporate settings. It
AI companies use fear as a marketing strategy for their powerful systems
The article critically examines how AI companies like Anthropic publicly express fear about their own powerful AI systems, framing this as a

Designing Transparency for Agentic AI Systems: Finding the Right Moments for Clarity
This article explores the design challenges of agentic AI systems, focusing on how to provide appropriate transparency without overwhelming
Critical Analysis of Anthropic's AI Research Paper: Questioning Methodology and Scientific Rigor
The article critiques a recent paper published by AI research company Anthropic, arguing that it contains questionable claims and lacks scie
Critique of Tech Industry's Deterministic Framing of Generative AI as Inevitable
The article critiques the tech industry's deterministic framing of generative AI as an inevitable future that must be accepted. The author a
The Public's Misunderstanding of AI Risks Compared to Software Bugs
The article argues that the general public has a fundamental misunderstanding of AI risks, comparing it to how people understand software bu
