English(EN) Position: Adversarial ML for LLMs Is Not Making Any Progress

研究人员认为：对抗性机器学习在大型语言模型方面停滞不前

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-03 04:00

一份新的立场文件认为，针对大型语言模型（LLMs）的对抗性机器学习研究并未取得重大进展。作者认为，该领域现在正在解决定义不明确、更难解决且更难评估的问题。他们警告说，该领域再花十年时间工作也可能只会带来微小的有意义的进展。 AI

影响对当前用于大型语言模型的对抗性机器学习技术的有效性提出了质疑，可能导致研究重点转移。

排序理由该集群包含一篇讨论研究进展的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · Javier Rando, Jie Zhang, Nicholas Carlini, Florian Tram\`er · 2026-06-03 04:00

Position: Adversarial ML for LLMs Is Not Making Any Progress

arXiv:2502.02260v2 Announce Type: replace Abstract: In the past decade, considerable research effort has been devoted to securing machine learning (ML) models that operate in adversarial settings. Yet, progress has been slow even for simple "toy" problems (e.g., robustness to sma…