Anthropic Just Won the Benchmark War It Started

Weeks after a government-ordered shutdown over jailbreak concerns, Claude Fable 5 has returned to top independent AI benchmarks, with four Anthropic models occupying the top five Intelligence Index slots.

#1, score 60

Fable 5 Rank

>99%

Jailbreak Block Rate

19 days

Suspension Length

Trace Cohen

Early-stage VC & angel · Founder, New York Venture Partners

July 5, 2026

1 min read

Claude Fable 5's return to full global availability on July 1 has been followed by a clean sweep of independent AI benchmark rankings: the model now sits at rank 1 on the Intelligence Index with a score of 60, ahead of Anthropic's own Opus 4.8 at 56 and OpenAI's GPT-5.5 at 55, with Anthropic models occupying four of the top five spots overall.

The comeback is notable given how the suspension happened. The US government ordered Fable 5 and the more restricted Mythos 5 offline on June 12 after Amazon researchers found a jailbreak technique that could prompt the model into identifying software vulnerabilities it was supposed to refuse to discuss -- a genuine national-security concern, not a routine safety hiccup. Anthropic spent the 19-day suspension building an improved safety classifier specifically targeting that technique, which the company says now blocks it in over 99% of attempts, and that fix is what convinced the Commerce Department to lift the export controls on June 30.

Alongside Fable 5's restoration, Anthropic launched Claude Sonnet 5, described as its most agentic Sonnet model yet, with stronger reasoning, tool use, coding and autonomous task handling, rolled out across Claude Code, the Claude Platform and Claude Enterprise with introductory pricing and higher rate limits -- broadening Anthropic's product lineup at exactly the moment its flagship model is topping public leaderboards again.

The sequence -- shutdown, fix, benchmark-topping relaunch, new product tier -- is a genuinely unusual turnaround story: most companies forced into a 19-day government-ordered shutdown over a security vulnerability would expect to lose ground to competitors during the outage, not emerge with their flagship model ranked first.

What to watch: whether OpenAI or Google respond with their own releases aimed at reclaiming the top Intelligence Index spot in the coming weeks, and whether Anthropic's new safety classifier holds up against novel jailbreak attempts now that Fable 5 is back at global scale.

Anthropic Just Won the Benchmark War It Started

#1, score 60

Fable 5 Rank

>99%

Jailbreak Block Rate

19 days

Suspension Length

Trace Cohen

Early-stage VC & angel · Founder, New York Venture Partners

July 5, 2026

1 min read

Anthropic Just Won the Benchmark War It Started

Read Next

Anthropic, Not OpenAI, Is Setting AI's 2026 Agenda

Samsung's 2nm AI Chip Talks Now Include Meta Too

Mistral Is Building Its Own Anti-OpenAI Playbook

Anthropic Just Won the Benchmark War It Started

Read Next

Anthropic, Not OpenAI, Is Setting AI's 2026 Agenda

Samsung's 2nm AI Chip Talks Now Include Meta Too

Mistral Is Building Its Own Anti-OpenAI Playbook