English(EN) Traditional statistical representations outperform generative AI in identifying expert peer reviewers

AI在同行评审和专家识别等细微任务方面表现不佳

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-18 17:59

两篇新研究论文探讨了当前AI模型在专业学术任务中的局限性。其中一项名为Sem-Detect的研究提出了一种通过分析语义内容而非仅仅文本特征来区分AI生成同行评审和人类撰写评审的方法。另一篇论文则表明，在识别科学领域的专家同行评审者方面，传统的统计方法（如TF-IDF）比GPT-4o mini等生成式AI模型更有效。 AI

影响当前AI模型在准确区分同行评审中的AI生成内容与人类工作以及识别专业专家方面存在局限性，表明传统方法在这些细微任务上仍然更胜一筹。

排序理由两篇在arXiv上发表的学术论文展示了AI在特定学术背景下的局限性研究成果。

在 arXiv cs.IR (Information Retrieval) 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Andr\'e V. Duarte, Brian Tufts, Aditya Oke, Fei Fang, Arlindo L. Oliveira, Lei Li · 2026-05-22 04:00

Sem-Detect：AI生成同行评审的语义级检测

arXiv:2605.21713v1 Announce Type: new Abstract: How can we distinguish whether a peer review was written by a human or generated by an AI model? We argue that, in this setting, authorship should not be attributed solely from the textual features of a review, but also from the ide…
arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Wolfgang Kerzendorf · 2026-05-18 17:59

传统统计方法在识别专家同行评审者方面优于生成式AI

The exponential growth of scientific submissions has strained the peer review system. Despite the rapidly expanding global pool of researchers, this unprecedented scale has rendered the previous approach of manual expert identification unfeasible. Therefore, institutions have nat…

报道来源 [2]

Sem-Detect：AI生成同行评审的语义级检测

传统统计方法在识别专家同行评审者方面优于生成式AI

相关实体

相关话题