Anthropic built a model too risky to release
Anthropic has developed a new AI model named Claude Mythos, which demonstrates significant advancements in benchmark performance, particularly in identifying software vulnerabilities. Due to its advanced capabilities in finding and exploiting security flaws, Anthropic has opted not to release Mythos publicly. Instead, the company is providing limited access to select organizations through "Project Glasswing" to aid in cybersecurity research and vulnerability discovery, alongside a substantial commitment to open-source security initiatives. AI
IMPACT Restricted release of advanced AI model highlights growing safety concerns and the potential for AI in cybersecurity, influencing future development and deployment strategies.