Anthropic released its most powerful model, Claude Fable, on June 9th, just days after expressing concerns about AI safety. This new model achieved a score of 80.0% on the SWE-Bench Pro benchmark, which specifically evaluates coding agents for accuracy. AI
IMPACT Sets new SOTA on coding benchmarks; pressures Anthropic to respond.
RANK_REASON Frontier-lab model release with system card. [lever_c_demoted from frontier_release: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →