AI shouldn't automate peer review without rigorous evaluation, paper argues

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-07 04:00

A new paper argues against the immediate automation of academic peer review using current large language models. The research highlights two major issues: AI reviewers exhibit an excessive agreement, limiting diverse perspectives, and their scores can be easily manipulated through stylistic paper rewrites rather than genuine scientific merit. The authors propose that a dedicated science of peer review automation is necessary, rather than deploying general-purpose LLMs without thorough evaluation. AI

影响 Current LLMs are not suitable for automating peer review due to lack of diversity and susceptibility to manipulation, necessitating specialized research.

排序理由 Academic paper evaluating the use of LLMs in peer review. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Joachim Baumann, Jiaxin Pei, Sanmi Koyejo, Dirk Hovy · 2026-05-07 04:00

Stop Automating Peer Review Without Rigorous Evaluation

arXiv:2605.03202v1 Announce Type: new Abstract: Large language models offer a tempting solution to address the peer review crisis. This position paper argues that today's AI systems should not be used to produce paper reviews. We ground this position in an empirical comparison of…

报道来源 [1]

Stop Automating Peer Review Without Rigorous Evaluation

相关实体

相关话题