Anthropic has released Claude Opus 4.8, a new model that reportedly outperforms GPT-5.5 and Gemini 3.1 Pro on a variety of tasks, particularly those involving long context windows. The model achieved a notable score on the SWE-bench benchmark, indicating strong performance in code generation and understanding. AI
IMPACT Sets a new benchmark for long-context performance, potentially influencing future model development and application design.
RANK_REASON New model release from a frontier lab with performance benchmarks. [lever_c_demoted from frontier_release: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →