English(EN)Gaming AI-Assisted Peer Reviews Poses New Risks to the Scientific Community
AI同行评审易受仅展示性攻击
作者PulseAugur 编辑部·[8 个来源]·
近期研究突显了AI辅助科学同行评审系统存在的重大漏洞。研究表明,AI评审员可能通过仅展示性的修改(例如更改摘要或表述方式)而被操纵,而无需改变核心科学内容。这些攻击可能导致评分虚高和接受率增加,引发担忧,即作者可能会为了迎合AI的判断而牺牲科学价值。此外,多模态AI评审员容易受到针对图表和文本的攻击,这需要强大的防御措施和谨慎的人工监督来维护同行评审过程的完整性。
AI
arXiv:2606.13044v1 Announce Type: new Abstract: As AI-generated reviews move from experimental tools into peer-review infrastructure, most robustness concerns have focused on explicit attacks such as hidden instructions and prompt injection. We study a harder and more policy-rele…
arXiv cs.CL
TIER_1English(EN)·Xinyu Zhao, Rana Muhammad Shahroz Khan, Zhen Xu, Zhen Tan, Tianlong Chen·
arXiv:2606.12716v1 Announce Type: new Abstract: The integration of Large Language Models (LLMs) and Multimodal LLMs (MLLMs) into scientific peer-review workflows introduces novel and significant risks for adversarial manipulation, especially given the multimodal nature of scienti…
arXiv:2606.13349v1 Announce Type: new Abstract: Large language models (LLMs) have shown promise in automating scientific peer review. However, existing approaches often struggle to generate in-depth reviews supported by concrete evidence. We argue that a key limitation is the lac…
Large language models (LLMs) have shown promise in automating scientific peer review. However, existing approaches often struggle to generate in-depth reviews supported by concrete evidence. We argue that a key limitation is the lack of flexibility to proactively investigate susp…
Large language models (LLMs) have shown promise in automating scientific peer review. However, existing approaches often struggle to generate in-depth reviews supported by concrete evidence. We argue that a key limitation is the lack of flexibility to proactively investigate susp…
As AI-generated reviews move from experimental tools into peer-review infrastructure, most robustness concerns have focused on explicit attacks such as hidden instructions and prompt injection. We study a harder and more policy-relevant failure mode: no hidden text, no prompt inj…
arXiv cs.AI
TIER_1English(EN)·Qiyao Wei, Samuel Holt, Jing Yang, Markus Wulfmeier, Mihaela van der Schaar·
arXiv:2506.08134v4 Announce Type: replace Abstract: Peer review, the bedrock of scientific advancement in machine learning (ML), is strained by a crisis of scale. Exponential growth in manuscript submissions to premier ML venues such as NeurIPS, ICML, and ICLR is outpacing the fi…
arXiv:2606.10159v1 Announce Type: cross Abstract: AI is increasingly used to support scientific peer review, from manuscript screening, reviewer assistance to editorial triage. Although such systems promise to reduce reviewer burden and accelerate publication, their robustness to…