PulseAugur
实时 23:57:54
Polski(PL) Nowy benchmark badaczy z Carnegie Mellon University ujawnia drastyczną różnicę w zdolnościach modeli AI do autonomicznego łamania zabezpieczeń silnika V8, choć

AI models show varied V8 engine exploit abilities; Claude Mythos cost questioned

Researchers from Carnegie Mellon University have developed a new benchmark to test AI models' ability to autonomously exploit vulnerabilities in the V8 JavaScript engine. The benchmark revealed significant differences in the capabilities of various AI models. However, the high operational costs associated with Claude Mythos raise questions about its practical commercial viability. AI

影响 This benchmark highlights AI's growing capacity for complex security exploits, raising concerns about potential misuse and the cost-effectiveness of advanced AI systems.

排序理由 The cluster describes a new benchmark developed by university researchers to evaluate AI model capabilities. [lever_c_demoted from research: ic=1 ai=1.0]

在 Mastodon — mastodon.social 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

AI models show varied V8 engine exploit abilities; Claude Mythos cost questioned

报道来源 [1]

  1. Mastodon — mastodon.social TIER_1 Polski(PL) · aisight ·

    New benchmark from Carnegie Mellon University researchers reveals a drastic difference in AI models' abilities to autonomously break V8 engine security, although

    Nowy benchmark badaczy z Carnegie Mellon University ujawnia drastyczną różnicę w zdolnościach modeli AI do autonomicznego łamania zabezpieczeń silnika V8, choć koszty operacji Claude Mythos rzucają cień na jego komercyjną opłacalność. # si # ai # sztucznainteligencja # wiadomości…