Anthropic's Claude Mythos model has reportedly caused the METR graph, a key visualization for AI progress, to break. The METR graph tracks AI capabilities over time, and Claude Mythos's performance appears to have exceeded its plotted limits. This suggests a significant leap in AI development, potentially outpacing current forecasting models. AI
IMPACT Suggests current AI progress metrics may be outdated, potentially accelerating the pace of AI development and forecasting.
RANK_REASON The cluster describes a new benchmark result for an AI model that challenges existing metrics for AI progress. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →