Anthropic's Claude Opus 4.8 outperforms GPT-5.5 and Gemini 3.1 Pro

By PulseAugur Editorial · [1 sources] · 2026-05-29 05:57

Anthropic has released Claude Opus 4.8, a new model that reportedly outperforms GPT-5.5 and Gemini 3.1 Pro on a variety of tasks, particularly those involving long context windows. The model achieved a notable score on the SWE-bench benchmark, indicating strong performance in code generation and understanding. AI

IMPACT Sets a new benchmark for long-context performance, potentially influencing future model development and application design.

RANK_REASON New model release from a frontier lab with performance benchmarks. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on Towards AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

Towards AI TIER_1 English(EN) · Chew Loong Nian - AI ENGINEER · 2026-05-29 05:57

I Tested Opus 4.8 vs GPT-5.5 vs Gemini 3.1 Pro on 20 Tasks — Opus Embarrassed Both on Long Context

<div class="medium-feed-item"><p class="medium-feed-snippet">Anthropic dropped Claude Opus 4.8 on May 28, 2026. The headline number that made me stop scrolling wasn’t the SWE-bench score. It was this…</p><p class="medium-feed-link"><a href="https://pub.towardsai.net…

COVERAGE [1]

I Tested Opus 4.8 vs GPT-5.5 vs Gemini 3.1 Pro on 20 Tasks — Opus Embarrassed Both on Long Context

RELATED ENTITIES

RELATED TOPICS