All Topics

Technology

Art

First reported by The Verge

Anthropic releases Claude Fable 5, its first Mythos-class AI model, citing new safety safeguards

Anthropic reverses policy on AI model guardrails, acknowledges poor tradeoff

Shubhangi Goel

3d ago· 3 min readenNews

80/100

Golden Brown

Bagelometer↗

Sesame, salt, and substance. A flagship bake.

Score80TypenewsSentimentnegative

Summary

Anthropic has reversed a policy change regarding guardrails for its frontier AI model development. The company acknowledged making a poor tradeoff with new safeguards and is now making them visible again. An Anthropic spokesperson confirmed the change, stating they are updating Fable 5's safeguards for frontier LLM development to ensure transparency.

Key quotes

· 2 pulled

We made the wrong tradeoff in new model guardrails

We're changing Fable 5's safeguards for frontier LLM development to make them visible

Snippet from the RSS feed

"We're changing Fable 5's safeguards for frontier LLM development to make them visible," an Anthropic spokesperson said.

You might also wanna read

Anthropic apologizes, pledges transparency after hidden guardrails in Claude Fable 5 AI model

Anthropic has apologized for secretly implementing hidden guardrails in its new AI model, Claude Fable 5, which throttled researchers and co

theverge.com·4d ago

Anthropic apologizes, pledges transparency after hidden guardrails in Claude Fable 5 AI model

Anthropic has apologized for secretly implementing hidden guardrails in its new AI model, Claude Fable 5, which throttled researchers and co

The Verge·4d ago

Anthropic apologizes, pledges transparency after hidden guardrails in Claude Fable 5 AI model

Anthropic has apologized for secretly implementing hidden guardrails in its new AI model, Claude Fable 5, which throttled researchers and co

theverge.com·4d ago

Cybersecurity researchers criticize Anthropic's Fable model for overly restrictive guardrails

Anthropic released Fable, a limited public version of its powerful cybersecurity model Mythos, but cybersecurity researchers are criticizing

techcrunch.com·5d ago

Anthropic Abandons Binding AI Safety Policy for Flexible Framework Amid Competitive Pressures

Anthropic, an AI safety-focused company founded by former OpenAI employees concerned about AI risks, is abandoning its core safety commitmen

cnn.com·3mo ago

Anthropic Reportedly Abandons Flagship AI Safety Pledge in Policy Shift

Anthropic, the AI safety-focused company, is reportedly abandoning its flagship safety pledge (Responsible Scaling Policy/RSP) that previous

time.com·3mo ago

Anthropic faces US export control order blocking foreign access to new AI models

Anthropic is facing a new government dispute after a June 12th export control order from the White House forced it to block foreign access t

The Verge·16h ago