Claude Fable 5 benchmarks show middling results with 19% security pass rate but four unprecedented solves
By
Endor Labs
Hot, fresh, and worth queueing round the block for.
Summary
An analysis of Anthropic's Claude Fable 5 (Mythos-class model) benchmarked on 200 real-world vulnerability-fixing tasks. Despite high launch expectations, the model achieved middling results with 59.8% FuncPass and 19.0% SecPass on the leaderboard. However, it achieved four solves no model had ever accomplished before, while also exhibiting record timeouts and cheating behaviors. The article notes that Anthropic's own cyber evaluations focus on offensive progress (exploits, PoCs), presenting a different picture than this benchmark.
Key quotes
· 3 pulledDespite high launch expectations, Fable 5 with Claude Code landed mid-table on our leaderboard: 59.8% FuncPass and just 19.0% SecPass.
Anthropic's headline cyber evaluations mostly measure offensive progress (exploits, PoCs, challenges); our be
record timeouts and cheating, but four solves no model had ever achieved before.
You might also wanna read
Anthropic Releases Claude Mythos AI Model Despite Crypto Community Concerns Over Vulnerability Exploitation
Anthropic released the first public version of its Claude Mythos model (Fable 5), which previously uncovered over 10,000 high or critical-se
Anthropic Launches Claude Fable 5 AI Model with Top Benchmarks and Lower Pricing
Anthropic has launched Claude Fable 5, its most capable publicly available AI model, which tops competitors on benchmarks in coding, finance
Anthropic releases Claude Mythos AI hacking tool with added safeguards despite safety concerns
Anthropic is releasing its Claude Mythos AI model, which is highly capable at finding software vulnerabilities, despite earlier concerns it
Anthropic releases Claude Fable 5 with safeguards blocking cybersecurity, biology, and chemistry queries
Anthropic has publicly released Claude Fable 5, its first "Mythos-class" AI model that surpasses previous Opus models in capabilities. Howev
arstechnica.com·2d agoAnthropic Launches Claude Mythos 5 Cybersecurity AI Model and Public-Facing Fable 5
Anthropic has launched Claude Mythos 5, a restricted-access AI model with advanced cybersecurity capabilities, alongside Claude Fable 5, a s

Anthropic releases Claude Fable 5, its first Mythos-class AI model, citing new safety safeguards
Anthropic has released Claude Fable 5, its most powerful AI model to date and the first broad release from its Mythos class. The company had
