Brief · PulseAugur

RESEARCH · Ben's Bites English(EN) · 1mo · [4 sources]

Anthropic built a model too risky to release

Anthropic has developed a new AI model named Claude Mythos, which demonstrates significant advancements in benchmark performance, particularly in identifying software vulnerabilities. Due to its advanced capabilities in finding and exploiting security flaws, Anthropic has opted not to release Mythos publicly. Instead, the company is providing limited access to select organizations through "Project Glasswing" to aid in cybersecurity research and vulnerability discovery, alongside a substantial commitment to open-source security initiatives. AI

IMPACT Restricted release of advanced AI model highlights growing safety concerns and the potential for AI in cybersecurity, influencing future development and deployment strategies.

Anthropic
Meta
Claude Mythos
Firefox
Project Glasswing
Claude Opus
Claude Sonnet
OpenBSD
FFmpeg
Muse Spark
Terminal-Bench 2.0
Sonnet 4.6
Opus 4.6
SWE-bench Pro