Anthropic has released Claude Opus 4.8, a new iteration of its large language model that demonstrates modest but tangible improvements over its predecessor. This latest version outperforms OpenAI's GPT-5.5 and Google's Gemini 3.1 Pro on a majority of benchmarks. Additionally, Claude Opus 4.8 exhibits enhanced self-correction capabilities, catching its own coding errors at four times the rate of previous versions. AI
IMPACT Sets new SOTA on coding benchmarks; pressures OpenAI to respond.
RANK_REASON Frontier-lab model release with system card. [lever_c_demoted from frontier_release: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →