PulseAugur / Brief
EN
LIVE 13:24:55

Brief

last 24h
[1/1] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

  1. 📰 GPT-5.5 batte Claude Fable 5 nel benchmark Agents Last Exam Un nuovo benchmark chiamato Agents Last Exam (ALE), creato dalla Berkeley RDI con oltre 300 espert

    OpenAI's GPT-5.5 has outperformed Anthropic's Claude Fable 5 on a new AI benchmark called Agents Last Exam (ALE). This benchmark, developed by Berkeley RDI with input from over 300 experts, tests autonomous AI agents. The result is surprising, as Claude Fable 5 was previously considered the leading model for such tasks. AI

    📰 GPT-5.5 batte Claude Fable 5 nel benchmark Agents Last Exam Un nuovo benchmark chiamato Agents Last Exam (ALE), creato dalla Berkeley RDI con oltre 300 espert

    IMPACT Sets a new performance standard for AI agents, potentially shifting the competitive landscape and influencing future development priorities.