PulseAugur
实时 16:52:27
English(EN) Gaming AI-Assisted Peer Reviews Poses New Risks to the Scientific Community

AI同行评审易受仅展示性攻击

近期研究突显了AI辅助科学同行评审系统存在的重大漏洞。研究表明,AI评审员可能通过仅展示性的修改(例如更改摘要或表述方式)而被操纵,而无需改变核心科学内容。这些攻击可能导致评分虚高和接受率增加,引发担忧,即作者可能会为了迎合AI的判断而牺牲科学价值。此外,多模态AI评审员容易受到针对图表和文本的攻击,这需要强大的防御措施和谨慎的人工监督来维护同行评审过程的完整性。 AI

影响 强调了在科学评估中需要强大的AI系统,以防止操纵并确保完整性。

排序理由 多篇研究论文详细介绍了AI辅助科学同行评审中的漏洞和潜在防御措施。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 8 个来源。 我们如何撰写摘要 →

报道来源 [8]

  1. arXiv cs.CL TIER_1 English(EN) · Xu Yang, Zhizhou Sha, Junbo Li, Jian Yu, Yifan Sun, Matthew Zhao, Jinrui Fang, Xinyue Guo, Yining Wu, Xu Hu, Yifu Luo, Qiang Liu, Zhangyang Wang ·

    No Hidden Prompts Needed! You Can Game AI Peer Review with Presentation-Only Revisions

    arXiv:2606.13044v1 Announce Type: new Abstract: As AI-generated reviews move from experimental tools into peer-review infrastructure, most robustness concerns have focused on explicit attacks such as hidden instructions and prompt injection. We study a harder and more policy-rele…

  2. arXiv cs.CL TIER_1 English(EN) · Xinyu Zhao, Rana Muhammad Shahroz Khan, Zhen Xu, Zhen Tan, Tianlong Chen ·

    Does AI Reviewer See the Full Picture? Attacking and Defending Multimodal Peer Review

    arXiv:2606.12716v1 Announce Type: new Abstract: The integration of Large Language Models (LLMs) and Multimodal LLMs (MLLMs) into scientific peer-review workflows introduces novel and significant risks for adversarial manipulation, especially given the multimodal nature of scienti…

  3. arXiv cs.CL TIER_1 English(EN) · Haishuo Fang, Yue Feng, Iryna Gurevych ·

    From Passive Generation to Investigation: A Proactive Scientific Peer Review Agent

    arXiv:2606.13349v1 Announce Type: new Abstract: Large language models (LLMs) have shown promise in automating scientific peer review. However, existing approaches often struggle to generate in-depth reviews supported by concrete evidence. We argue that a key limitation is the lac…

  4. Hugging Face Daily Papers TIER_1 English(EN) ·

    From Passive Generation to Investigation: A Proactive Scientific Peer Review Agent

    Large language models (LLMs) have shown promise in automating scientific peer review. However, existing approaches often struggle to generate in-depth reviews supported by concrete evidence. We argue that a key limitation is the lack of flexibility to proactively investigate susp…

  5. arXiv cs.CL TIER_1 English(EN) · Iryna Gurevych ·

    From Passive Generation to Investigation: A Proactive Scientific Peer Review Agent

    Large language models (LLMs) have shown promise in automating scientific peer review. However, existing approaches often struggle to generate in-depth reviews supported by concrete evidence. We argue that a key limitation is the lack of flexibility to proactively investigate susp…

  6. arXiv cs.CL TIER_1 English(EN) · Zhangyang Wang ·

    No Hidden Prompts Needed! You Can Game AI Peer Review with Presentation-Only Revisions

    As AI-generated reviews move from experimental tools into peer-review infrastructure, most robustness concerns have focused on explicit attacks such as hidden instructions and prompt injection. We study a harder and more policy-relevant failure mode: no hidden text, no prompt inj…

  7. arXiv cs.AI TIER_1 English(EN) · Qiyao Wei, Samuel Holt, Jing Yang, Markus Wulfmeier, Mihaela van der Schaar ·

    Position: The ML Community Must Build an AI-Augmented Peer-Review Ecosystem

    arXiv:2506.08134v4 Announce Type: replace Abstract: Peer review, the bedrock of scientific advancement in machine learning (ML), is strained by a crisis of scale. Exponential growth in manuscript submissions to premier ML venues such as NeurIPS, ICML, and ICLR is outpacing the fi…

  8. arXiv cs.AI TIER_1 English(EN) · Lin Li, Qi Zhang, Xander Davies, Jianing Qiu, Yarin Gal ·

    Gaming AI-Assisted Peer Reviews Poses New Risks to the Scientific Community

    arXiv:2606.10159v1 Announce Type: cross Abstract: AI is increasingly used to support scientific peer review, from manuscript screening, reviewer assistance to editorial triage. Although such systems promise to reduce reviewer burden and accelerate publication, their robustness to…