English(EN) @ emollick Seems GPT-5.2 reaches expert level in peer review: 45 scientists took 469 hours evaluating human & AI reviews on 82 papers. "Surprisingly, current AI

GPT-5.2 在科学同行评审中展现专家级表现

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-21 23:09

一项近期评估表明，GPT-5.2 在科学同行评审中表现出专家级水平。在一项涉及 45 名科学家和 469 小时的研究中，AI 评审在 82 篇论文上被发现与顶尖人类评审员不相上下。然而，AI 仍存在不足，表明 AI 与人类协作的混合方法是同行评审的最佳选择。 AI

影响 AI 模型在科学同行评审等复杂任务中正变得与人类专家具有竞争力，预示着研究效率和质量的提升潜力。

排序理由该集群描述了一项评估 AI 模型在特定任务（同行评审）上表现的研究，属于研究范畴。[lever_c_demoted from research: ic=1 ai=1.0]

在 Mastodon — sigmoid.social 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

Mastodon — sigmoid.social TIER_1 English(EN) · [email protected] · 2026-05-21 23:09

@ emollick Seems GPT-5.2 reaches expert level in peer review: 45 scientists took 469 hours evaluating human & AI reviews on 82 papers. "Surprisingly, current AI

@ emollick Seems GPT-5.2 reaches expert level in peer review: 45 scientists took 469 hours evaluating human & AI reviews on 82 papers. "Surprisingly, current AI reviewers are competitive even with the top-rated reviewers in Nature’s official peer review..." though not without wea…

报道来源 [1]

@ emollick Seems GPT-5.2 reaches expert level in peer review: 45 scientists took 469 hours evaluating human & AI reviews on 82 papers. "Surprisingly, current AI

相关实体

相关话题