All Topics

Technology

Art

Claude Opus 4.8 Scores 69.2% on SWE-Bench Pro with New Parallel Agent Features

Sophie Zhang

2h ago· 6 min readenNews

90/100

Golden Brown

Bagelometer↗

A baker's-dozen of insight crammed into one ring.

Score90TypenewsSentimentpositive

Summary

Anthropic's Claude Opus 4.8 achieves 69.2% on SWE-bench Pro, a 4.9 percentage point improvement over its predecessor, placing it second on the public leaderboard behind the unreleased Mythos Preview (77.8%). The model introduces parallel subagent capabilities in Claude Code, allowing hundreds of subagents to work simultaneously on software engineering tasks. Pricing remains unchanged at $5 per million input tokens. The article notes that SWE-bench Verified is nearing saturation at 88.6%, making SWE-bench Pro the key benchmark where meaningful progress remains.

Key quotes

· 3 pulled

Claude Opus 4.8 scores 4.9 percentage points higher than its predecessor on SWE-bench Pro, landing at 69.2%

SWE-bench Verified - the easier variant - is approaching saturation at 88.6%. Pro is where real headroom remains, and Anthropic is closing it.

Claude Opus 4.8 ships hundreds of parallel subagents in Claude Code, with pricing unchanged at $5 per million input tokens.

Snippet from the RSS feed

Anthropic's Claude Opus 4.8 scores 69.2% on SWE-bench Pro and ships hundreds of parallel subagents in Claude Code, with pricing unchanged at $5 per million input tokens.

You might also wanna read

Anthropic Launches Claude Opus 4.8 with Faster Performance and Lower Costs

Anthropic has released Claude Opus 4.8, an upgraded version of their flagship AI model, building on Opus 4.7 with improvements across benchm

anthropic.com·11d ago

Anthropic Releases Claude Opus 4.1 with Enhanced Coding and Reasoning Capabilities

Anthropic has released Claude Opus 4.1, an upgraded version of Claude Opus 4, focusing on agentic tasks, real-world coding, and reasoning. T

anthropic.com·10mo ago

Anthropic Releases Claude Opus 4.6 AI Model with Enhanced Multi-Step Task Capabilities

Anthropic has released Claude Opus 4.6, described as a 'direct upgrade' from its predecessor with improved capabilities for handling complex

The Verge·4mo ago

Anthropic Releases Claude Opus 4.5 AI Model with Enhanced Coding and Productivity Capabilities

Anthropic announces the release of Claude Opus 4.5, their newest AI model that represents a significant advancement in AI capabilities. The

anthropic.com·6mo ago

Anthropic Releases Claude Opus 4.7 with Enhanced Software Engineering and Vision Capabilities

Anthropic has released Claude Opus 4.7, a significant upgrade to their AI model that shows notable improvements in advanced software enginee

anthropic.com·1mo ago

Anthropic Releases Claude Opus 4.7 AI Model with 1M Context Window and Enhanced Coding Capabilities

Anthropic announces Claude Opus 4.7, their latest AI model featuring a hybrid reasoning architecture with a 1 million token context window.

anthropic.com·10d ago

Anthropic Releases Claude Opus 4.7 AI Model with 1M Context Window and Enhanced Coding Capabilities

Anthropic announces Claude Opus 4.7, their latest AI model featuring a hybrid reasoning architecture with a 1 million token context window.

anthropic.com·10d ago