Anthropic Launches Claude Sonnet 5 at a Steep Discount as It Races Toward a Blockbuster IPO

Anthropic released Claude Sonnet 5 on June 30, setting it as the new default model for Claude Free and Pro users and pricing it at a steep discount to its flagship Opus 4.8 -- $2 per million input tokens and $10 per million output tokens through August 31, roughly 60% cheaper than Opus during the introductory window. The launch lands as Anthropic barrels toward an IPO, having confidentially filed with the SEC on June 1, turning a routine model release into a test of whether cheap, capable agents can carry the company's valuation into public markets.

June 30, 2026

Launch Date

$2 / M tokens (thru Aug 31)

Intro Price (input)

$10 / M tokens (thru Aug 31)

Intro Price (output)

$3 input / $15 output per M

Post-Intro Price

$5 input / $25 output per M

Opus 4.8 Price

Trace Cohen

Early-stage VC & angel · Founder, New York Venture Partners

June 30, 2026

3 min read

Anthropic released Claude Sonnet 5 on June 30, 2026, immediately making it the default model for Claude Free and Pro users and pricing it well below its own flagship. Introductory API pricing runs $2 per million input tokens and $10 per million output tokens through August 31, rising to $3 and $15 after that -- still a roughly 40% discount to Opus 4.8's standard $5 input / $25 output pricing, and closer to 60% cheaper during the introductory window, according to VentureBeat and TechCrunch.

The release is not just a model refresh -- it lands as Anthropic prepares for the public markets. The company confidentially filed IPO paperwork with the SEC on June 1, and Sonnet 5 arrives explicitly as what VentureBeat called a test of 'whether the private market's staggering AI valuations can survive public scrutiny.' Anthropic is choosing to enter that scrutiny with a model built for cheap, high-volume agentic workloads rather than a marginal bump in frontier benchmark scores.

The timing sits inside a brutal pricing war across the frontier labs. The same week, Google shipped Nano Banana 2 Lite (Gemini 3.1 Flash-Lite) as a faster, cheaper image generator, and DeepSeek open-sourced DSpark, a framework claiming to speed LLM inference by up to 85%. Meituan open-sourced LongCat-2.0, a 1.6-trillion-parameter, near-frontier agentic coding model trained entirely on Chinese chips. Every major lab is racing to cut the cost of running agents at scale, because the bottleneck for enterprise AI adoption has shifted from raw capability to unit economics.

“Meituan open-sourced LongCat-2.0, a 1.6-trillion-parameter, near-frontier agentic coding model trained entirely on Chinese chips.”

The numbers matter in context. Anthropic's decision to discount a near-Opus model by roughly 60% during launch is aggressive even by 2026 standards, and it directly pressures OpenAI's Codex pricing and Google's Gemini tiers -- both of which have also been cutting prices to win developer mindshare for coding and agent workloads. Sonnet 5 becoming the default free-tier model, rather than staying gated behind Pro, is itself a volume play: more usage data, more developer habit formation, more surface area before the roadshow starts.

For founders and operators building on top of frontier models, the read is straightforward: inference costs for near-frontier capability are falling fast, which lowers the cost of building AI-native products but also compresses the margin advantage of being an early mover on any single model. For GPs and LPs, Anthropic's IPO timeline is now concrete enough to plan around, and the market's reception will be a referendum on whether AI-lab valuations built on private funding rounds hold up against public investors who will demand real unit economics, not narrative.

The bear case is that discounting a flagship-adjacent model ahead of an IPO could read as margin pressure rather than strength -- public investors may ask why a company burning enormous compute costs is also cutting prices right before it needs to show a path to profitability. Model commoditization is also accelerating: if Sonnet 5, Gemini 3.1 Flash-Lite and open-source models like LongCat-2.0 are converging on similar capability-per-dollar, the moat shifts even further toward distribution, tooling and enterprise trust rather than the model itself.

What to watch: how OpenAI and Google respond on pricing in the coming weeks, whether Anthropic's IPO filing goes public and at what valuation, and whether Sonnet 5's default-tier rollout meaningfully shifts usage share away from GPT and Gemini among developers building agents.

Anthropic Launches Claude Sonnet 5 at a Steep Discount as It Races Toward a Blockbuster IPO

June 30, 2026

Launch Date

$2 / M tokens (thru Aug 31)

Intro Price (input)

$10 / M tokens (thru Aug 31)

Intro Price (output)

$3 input / $15 output per M

Post-Intro Price

$5 input / $25 output per M

Opus 4.8 Price

Trace Cohen

Early-stage VC & angel · Founder, New York Venture Partners

June 30, 2026

3 min read

“Meituan open-sourced LongCat-2.0, a 1.6-trillion-parameter, near-frontier agentic coding model trained entirely on Chinese chips.”

Anthropic Launches Claude Sonnet 5 at a Steep Discount as It Races Toward a Blockbuster IPO

Read Next

OpenAI Previews GPT-5.6 Sol, Terra and Luna — Restricted to Trusted Partners at Washington's Request

Cerebras Runs OpenAI GPT-5.6 Sol at 750 Tokens per Second, Setting a New Frontier-Model Speed Record

Adobe Acquires Topaz Labs to Bring Best-in-Class AI Image Enhancement Native to Creative Cloud

Anthropic Launches Claude Sonnet 5 at a Steep Discount as It Races Toward a Blockbuster IPO

Read Next

OpenAI Previews GPT-5.6 Sol, Terra and Luna — Restricted to Trusted Partners at Washington's Request

Cerebras Runs OpenAI GPT-5.6 Sol at 750 Tokens per Second, Setting a New Frontier-Model Speed Record

Adobe Acquires Topaz Labs to Bring Best-in-Class AI Image Enhancement Native to Creative Cloud