Deutsch(DE) # ExploitBench : Forscher der Carnegie Mellon University messen erstmals stufenweise, wie weit ein # AI -Modell eine # Sicherheitslücke wirklich ausnutzen kann.

AI模型被测试利用漏洞能力；Anthropic的Mythos展示了高级执行能力

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-18 16:25

卡内基梅隆大学的研究人员开发了ExploitBench，这是一个新的框架，用于衡量AI模型利用安全漏洞的有效性。虽然大多数公开的前沿模型会导致崩溃，但它们通常无法突破沙箱环境。然而，Anthropic的私有Mythos Preview模型在41个漏洞中的18个上展示了完整的代码执行能力，这表明AI在网络安全利用方面的能力令人担忧地进步了。 AI

影响 AI模型正在展示利用安全漏洞的高级能力，给网络安全防御带来了新的挑战。

排序理由该集群描述了一个新的研究框架及其在AI模型利用安全漏洞能力方面的发现。[lever_c_demoted from research: ic=1 ai=1.0]

在 Mastodon — sigmoid.social 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

Mastodon — sigmoid.social TIER_1 Deutsch(DE) · [email protected] · 2026-06-18 16:25

# ExploitBench: Researchers from Carnegie Mellon University measure for the first time, step-by-step, how far an #AI model can exploit a #security vulnerability.

# ExploitBench : Forscher der Carnegie Mellon University messen erstmals stufenweise, wie weit ein # AI -Modell eine # Sicherheitslücke wirklich ausnutzen kann. Öffentliche Frontier-Modelle lösen Abstürze aus, scheitern aber daran, die V8-Sandbox zu durchbrechen. Einzige Ausnahme…

链接 arxiv.org/…/2605.14153

报道来源 [1]

# ExploitBench: Researchers from Carnegie Mellon University measure for the first time, step-by-step, how far an #AI model can exploit a #security vulnerability.

相关实体

相关话题