All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Claude Opus 4.8 Scores 69.2% on SWE-Bench Pro with New Parallel Agent Features

By

Sophie Zhang

2h ago· 6 min readenNews

Summary

Anthropic's Claude Opus 4.8 achieves 69.2% on SWE-bench Pro, a 4.9 percentage point improvement over its predecessor, placing it second on the public leaderboard behind the unreleased Mythos Preview (77.8%). The model introduces parallel subagent capabilities in Claude Code, allowing hundreds of subagents to work simultaneously on software engineering tasks. Pricing remains unchanged at $5 per million input tokens. The article notes that SWE-bench Verified is nearing saturation at 88.6%, making SWE-bench Pro the key benchmark where meaningful progress remains.

Key quotes

· 3 pulled
Claude Opus 4.8 scores 4.9 percentage points higher than its predecessor on SWE-bench Pro, landing at 69.2%
SWE-bench Verified - the easier variant - is approaching saturation at 88.6%. Pro is where real headroom remains, and Anthropic is closing it.
Claude Opus 4.8 ships hundreds of parallel subagents in Claude Code, with pricing unchanged at $5 per million input tokens.
Snippet from the RSS feed
Anthropic's Claude Opus 4.8 scores 69.2% on SWE-bench Pro and ships hundreds of parallel subagents in Claude Code, with pricing unchanged at $5 per million input tokens.

You might also wanna read

Anthropic Launches Claude Opus 4.8 with Faster Performance and Lower Costs

Anthropic has released Claude Opus 4.8, an upgraded version of their flagship AI model, building on Opus 4.7 with improvements across benchm

anthropic.com·11d ago

Anthropic Releases Claude Opus 4.1 with Enhanced Coding and Reasoning Capabilities

Anthropic has released Claude Opus 4.1, an upgraded version of Claude Opus 4, focusing on agentic tasks, real-world coding, and reasoning. T

anthropic.com·10mo ago

Anthropic Releases Claude Opus 4.6 AI Model with Enhanced Multi-Step Task Capabilities

Anthropic has released Claude Opus 4.6, described as a 'direct upgrade' from its predecessor with improved capabilities for handling complex

The Verge·4mo ago

Anthropic Releases Claude Opus 4.5 AI Model with Enhanced Coding and Productivity Capabilities

Anthropic announces the release of Claude Opus 4.5, their newest AI model that represents a significant advancement in AI capabilities. The

anthropic.com·6mo ago

Anthropic Releases Claude Opus 4.7 with Enhanced Software Engineering and Vision Capabilities

Anthropic has released Claude Opus 4.7, a significant upgrade to their AI model that shows notable improvements in advanced software enginee

anthropic.com·1mo ago

Anthropic Releases Claude Opus 4.7 AI Model with 1M Context Window and Enhanced Coding Capabilities

Anthropic announces Claude Opus 4.7, their latest AI model featuring a hybrid reasoning architecture with a 1 million token context window.

anthropic.com·10d ago

Anthropic Releases Claude Opus 4.7 AI Model with 1M Context Window and Enhanced Coding Capabilities

Anthropic announces Claude Opus 4.7, their latest AI model featuring a hybrid reasoning architecture with a 1 million token context window.

anthropic.com·10d ago