PulseAugur
EN
LIVE 22:27:34

Anthropic's Claude Opus 4.8 surpasses GPT-5.5 in benchmarks

Anthropic has released Claude Opus 4.8, a new iteration of its large language model that demonstrates modest but tangible improvements over its predecessor. This latest version outperforms OpenAI's GPT-5.5 and Google's Gemini 3.1 Pro on a majority of benchmarks. Additionally, Claude Opus 4.8 exhibits enhanced self-correction capabilities, catching its own coding errors at four times the rate of previous versions. AI

IMPACT Sets new SOTA on coding benchmarks; pressures OpenAI to respond.

RANK_REASON Frontier-lab model release with system card. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on The Decoder →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Anthropic's Claude Opus 4.8 surpasses GPT-5.5 in benchmarks

COVERAGE [1]

  1. The Decoder TIER_1 English(EN) · Matthias Bastian ·

    Anthropic ships Claude Opus 4.8 as a "modest but tangible improvement" that tops GPT-5.5 in most benchmarks

    <p><img alt="" class="attachment-full size-full wp-post-image" height="1440" src="https://the-decoder.com/wp-content/uploads/2026/05/claude_opus_48_title-scaled.webp" style="height: auto; margin-bottom: 10px;" width="2560" /></p> <p> Anthropic releases Claude Opus 4.8, which beat…