Deutsch(DE) # ExploitBench : Forscher der Carnegie Mellon University messen erstmals stufenweise, wie weit ein # AI -Modell eine # Sicherheitslücke wirklich ausnutzen kann.

AI models tested for exploit capabilities; Anthropic's Mythos shows advanced execution

By PulseAugur Editorial · [1 sources] · 2026-06-18 16:25

Researchers at Carnegie Mellon University have developed ExploitBench, a new framework to measure how effectively AI models can exploit security vulnerabilities. While most public frontier models cause crashes, they generally fail to breach sandbox environments. Anthropic's private Mythos Preview model, however, demonstrated complete code execution on 18 out of 41 vulnerabilities, indicating a concerning advancement in AI's cybersecurity exploitation capabilities. AI

IMPACT AI models are demonstrating advanced capabilities in exploiting security vulnerabilities, posing new challenges for cybersecurity defenses.

RANK_REASON The cluster describes a new research framework and its findings on AI model capabilities in exploiting security vulnerabilities. [lever_c_demoted from research: ic=1 ai=1.0]

Read on Mastodon — sigmoid.social →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

AI models tested for exploit capabilities; Anthropic's Mythos shows advanced execution

COVERAGE [1]

Mastodon — sigmoid.social TIER_1 Deutsch(DE) · [email protected] · 2026-06-18 16:25

# ExploitBench: Researchers from Carnegie Mellon University measure for the first time, step-by-step, how far an #AI model can exploit a #security vulnerability.

# ExploitBench : Forscher der Carnegie Mellon University messen erstmals stufenweise, wie weit ein # AI -Modell eine # Sicherheitslücke wirklich ausnutzen kann. Öffentliche Frontier-Modelle lösen Abstürze aus, scheitern aber daran, die V8-Sandbox zu durchbrechen. Einzige Ausnahme…

LINKS arxiv.org/…/2605.14153

COVERAGE [1]

# ExploitBench: Researchers from Carnegie Mellon University measure for the first time, step-by-step, how far an #AI model can exploit a #security vulnerability.

RELATED ENTITIES

RELATED TOPICS