English(EN) Gaming AI-Assisted Peer Reviews Poses New Risks to the Scientific Community

AI同行评审易受仅展示性攻击

作者 PulseAugur 编辑部 · [8 个来源] · 2026-06-10 04:00

近期研究突显了AI辅助科学同行评审系统存在的重大漏洞。研究表明，AI评审员可能通过仅展示性的修改（例如更改摘要或表述方式）而被操纵，而无需改变核心科学内容。这些攻击可能导致评分虚高和接受率增加，引发担忧，即作者可能会为了迎合AI的判断而牺牲科学价值。此外，多模态AI评审员容易受到针对图表和文本的攻击，这需要强大的防御措施和谨慎的人工监督来维护同行评审过程的完整性。 AI

影响强调了在科学评估中需要强大的AI系统，以防止操纵并确保完整性。

排序理由多篇研究论文详细介绍了AI辅助科学同行评审中的漏洞和潜在防御措施。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 8 个来源。我们如何撰写摘要 →

报道来源 [8]

arXiv cs.CL TIER_1 English(EN) · Xu Yang, Zhizhou Sha, Junbo Li, Jian Yu, Yifan Sun, Matthew Zhao, Jinrui Fang, Xinyue Guo, Yining Wu, Xu Hu, Yifu Luo, Qiang Liu, Zhangyang Wang · 2026-06-12 04:00

无需隐藏提示！你可以通过仅展示的修改来玩转 AI 同行评审

arXiv:2606.13044v1 Announce Type: new Abstract: As AI-generated reviews move from experimental tools into peer-review infrastructure, most robustness concerns have focused on explicit attacks such as hidden instructions and prompt injection. We study a harder and more policy-rele…
arXiv cs.CL TIER_1 English(EN) · Xinyu Zhao, Rana Muhammad Shahroz Khan, Zhen Xu, Zhen Tan, Tianlong Chen · 2026-06-12 04:00

AI评审员是否看到全貌？攻击与防御多模态同行评审

arXiv:2606.12716v1 Announce Type: new Abstract: The integration of Large Language Models (LLMs) and Multimodal LLMs (MLLMs) into scientific peer-review workflows introduces novel and significant risks for adversarial manipulation, especially given the multimodal nature of scienti…
arXiv cs.CL TIER_1 English(EN) · Haishuo Fang, Yue Feng, Iryna Gurevych · 2026-06-12 04:00

从被动生成到主动调查：一个主动的科学同行评审代理

arXiv:2606.13349v1 Announce Type: new Abstract: Large language models (LLMs) have shown promise in automating scientific peer review. However, existing approaches often struggle to generate in-depth reviews supported by concrete evidence. We argue that a key limitation is the lac…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-06-11 13:38

从被动生成到主动调查：一个主动的科学同行评审代理

Large language models (LLMs) have shown promise in automating scientific peer review. However, existing approaches often struggle to generate in-depth reviews supported by concrete evidence. We argue that a key limitation is the lack of flexibility to proactively investigate susp…
arXiv cs.CL TIER_1 English(EN) · Iryna Gurevych · 2026-06-11 13:38

从被动生成到调查：一个主动的科学同行评审代理

Large language models (LLMs) have shown promise in automating scientific peer review. However, existing approaches often struggle to generate in-depth reviews supported by concrete evidence. We argue that a key limitation is the lack of flexibility to proactively investigate susp…
arXiv cs.CL TIER_1 English(EN) · Zhangyang Wang · 2026-06-11 08:30

无需隐藏提示！您可以通过仅演示的修改来玩转 AI 同行评审

As AI-generated reviews move from experimental tools into peer-review infrastructure, most robustness concerns have focused on explicit attacks such as hidden instructions and prompt injection. We study a harder and more policy-relevant failure mode: no hidden text, no prompt inj…
arXiv cs.AI TIER_1 English(EN) · Qiyao Wei, Samuel Holt, Jing Yang, Markus Wulfmeier, Mihaela van der Schaar · 2026-06-10 04:00

观点：机器学习社区必须构建一个AI增强的同行评审生态系统

arXiv:2506.08134v4 Announce Type: replace Abstract: Peer review, the bedrock of scientific advancement in machine learning (ML), is strained by a crisis of scale. Exponential growth in manuscript submissions to premier ML venues such as NeurIPS, ICML, and ICLR is outpacing the fi…
arXiv cs.AI TIER_1 English(EN) · Lin Li, Qi Zhang, Xander Davies, Jianing Qiu, Yarin Gal · 2026-06-10 04:00

游戏AI辅助同行评审对科学界构成新风险

arXiv:2606.10159v1 Announce Type: cross Abstract: AI is increasingly used to support scientific peer review, from manuscript screening, reviewer assistance to editorial triage. Although such systems promise to reduce reviewer burden and accelerate publication, their robustness to…

报道来源 [8]

相关实体

相关话题