PulseAugur
EN
LIVE 20:16:26

Anthropic's Claude Opus 4.8 outperforms GPT-5.5 and Gemini 3.1 Pro

Anthropic has released Claude Opus 4.8, a new model that reportedly outperforms GPT-5.5 and Gemini 3.1 Pro on a variety of tasks, particularly those involving long context windows. The model achieved a notable score on the SWE-bench benchmark, indicating strong performance in code generation and understanding. AI

IMPACT Sets a new benchmark for long-context performance, potentially influencing future model development and application design.

RANK_REASON New model release from a frontier lab with performance benchmarks. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on Towards AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. Towards AI TIER_1 English(EN) · Chew Loong Nian - AI ENGINEER ·

    I Tested Opus 4.8 vs GPT-5.5 vs Gemini 3.1 Pro on 20 Tasks — Opus Embarrassed Both on Long Context

    <div class="medium-feed-item"><p class="medium-feed-snippet">Anthropic dropped Claude Opus 4.8 on May 28, 2026. The headline number that made me stop scrolling wasn&#x2019;t the SWE-bench score. It was this&#x2026;</p><p class="medium-feed-link"><a href="https://pub.towardsai.net…