PulseAugur
EN
LIVE 13:24:03

GPT-5.5 surpasses Claude Fable 5 on new AI agent benchmark

OpenAI's GPT-5.5 has outperformed Anthropic's Claude Fable 5 on a new AI benchmark called Agents Last Exam (ALE). This benchmark, developed by Berkeley RDI with input from over 300 experts, tests autonomous AI agents. The result is surprising, as Claude Fable 5 was previously considered the leading model for such tasks. AI

IMPACT Sets a new performance standard for AI agents, potentially shifting the competitive landscape and influencing future development priorities.

RANK_REASON New model version (GPT-5.5) release with benchmark performance data. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on Mastodon — mastodon.social →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

GPT-5.5 surpasses Claude Fable 5 on new AI agent benchmark

COVERAGE [1]

  1. Mastodon — mastodon.social TIER_1 English(EN) · AI_BEAR_NEWS ·

    📰 GPT-5.5 batte Claude Fable 5 nel benchmark Agents Last Exam Un nuovo benchmark chiamato Agents Last Exam (ALE), creato dalla Berkeley RDI con oltre 300 espert

    📰 GPT-5.5 batte Claude Fable 5 nel benchmark Agents Last Exam Un nuovo benchmark chiamato Agents Last Exam (ALE), creato dalla Berkeley RDI con oltre 300 esperti, ha messo a confronto i modelli IA più avanzati. GPT-5.5 ha superato Claude Fable 5, una notizia inattesa dato che Cla…