Anthropic rolls out Claude Sonnet 5, its most agentic model yet

Anthropic has just rolled out Claude Sonnet 5, bringing more autonomous tool use and stronger reasoning at lower prices. The company says it nears Opus 4.8 on key benchmarks, and it’s now the default for Free and Pro users.

June 30, 2026

•

Claude

TL;DR

Claude Sonnet 5 launched: Plans, tool use (browser/terminal), and autonomous runs; positioned as more agentic than prior Sonnets
Benchmark highlights: SWE-bench Pro 63.2%; Terminal-Bench 2.1 80.4%; OSWorld-Verified 81.2%
Near-Opus performance: Humanity’s Last Exam (tools) 57.4% vs Opus 4.8 57.9%; GDPval-AA v2 1618 vs 1615
Partner feedback: Completes complex tasks, self-checks outputs without prompting; cited “attractive price point”
Availability: Default for Free/Pro; also for Max, Team, Enterprise across Claude apps and Claude Platform
Pricing: $2/M input, $10/M output through Aug 31, 2026; $3/$15 starting Sep 1, 2026

Anthropic on Tuesday introduced Claude Sonnet 5, describing it as its “most agentic Sonnet yet” and claiming it can make plans, use tools such as browsers and terminals, and run autonomously at a level that “just a few months ago required larger and more expensive models.”

The company also claimed Sonnet 5 is “a substantial improvement” over Sonnet 4.6 in reasoning, tool use, coding, and knowledge work, while landing “close to Opus 4.8” at lower prices. In a benchmark card shared alongside the launch, Sonnet 5 posted 63.2% on SWE-bench Pro versus 58.1% for Sonnet 4.6 and 69.2% for Opus 4.8; 80.4% on Terminal-Bench 2.1 versus 67.0% and 82.7%; and 81.2% on OSWorld-Verified versus 78.5% and 83.4%. On Humanity’s Last Exam with tools, Sonnet 5 scored 57.4%, just behind Opus 4.8’s 57.9%, while on GDPval-AA v2 it reached 1618, slightly ahead of Opus 4.8’s 1615.

Anthropic says early access partners found that Sonnet 5 finishes complex tasks where earlier Sonnets stopped short, checks its own output without being asked, and does its agentic work at what the company calls an “attractive price point.” The model is now the default on Free and Pro, and is also available to Max, Team, and Enterprise users across Claude apps and the Claude Platform.

The pricing table attached to the announcement lists Sonnet 5 at $2 per million input tokens and $10 per million output tokens through August 31, 2026, before rising to $3 and $15 on September 1, 2026, the same price as Sonnet 4.6. Anthropic also lists Sonnet 4.6 at the same standard $3/$15 rates and Opus 4.8 at $5/$25. Responses on X quickly turned to comparisons with Opus, pricing questions, and requests for higher usage limits.

Source: Claude (@claudeai)

Continue the conversation on Slack

Did this article spark your interest? Join our community of experts and enthusiasts to dive deeper, ask questions, and share your ideas.

Join our community

Claude Desktop beta arrives on Linux for Debian, Ubuntu users

Anthropic has just rolled out Claude Desktop for Linux, bringing a native app experience to Ubuntu and Debian in beta. Early reactions focus on practical requests, including Arch/Fedora support, RPMs, Wayland compatibility, and performance details.

Jun 30, 2026

1 shared tag

Claude Design beta brings on-brand canvas editing for paid users

Claude has just rolled out Claude Design, a beta feature for paid web and desktop plans. It can import a design system from code or files, edit layouts directly on a canvas, and sync changes both ways with Claude Code.

Jun 18, 2026

1 shared tag

Why Claude Fable 5 thrives on loops and memory

Lance Martin says Claude Fable 5 performs best when built around self-correction loops and cross-session memory. Using benchmarks like Parameter Golf and Continual Learning Bench, he reports stronger gains than Opus 4.7—while replies debate cost and verification.

Jun 11, 2026

1 shared tag

Continue the conversation on Slack

Related Articles

Claude Desktop beta arrives on Linux for Debian, Ubuntu users

Claude Design beta brings on-brand canvas editing for paid users

Why Claude Fable 5 thrives on loops and memory