Anthropic has released Claude 3.5 Sonnet 4.6, an upgrade to their previous Sonnet 4.5 model. This new version boasts broad improvements across coding, computer use, and long-context reasoning, and includes a 1 million token context window in beta. Early evaluations suggest its performance is approaching that of Opus-class models, though it may use more tokens for certain tasks. Additionally, METR has conducted preliminary safety evaluations on both Claude 3.5 Sonnet and OpenAI's o1 model, finding no significant evidence of dangerous capabilities but noting limitations in their testing methods. AI
Summary written by gemini-2.5-flash-lite from 7 sources. How we write summaries →
IMPACT Anthropic's Claude 3.5 Sonnet 4.6 release, with its enhanced capabilities and large context window, may push competitors to accelerate their own model development and feature rollouts.
RANK_REASON This cluster details a new model release from Anthropic with significant capability upgrades and new features, alongside preliminary safety evaluations.