English(EN) SciZoom: A Large-scale Benchmark for Hierarchical Scientific Summarization across the LLM Era

新的SciZoom基准测试追踪大型语言模型对科学写作的影响

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-24 04:00

研究人员推出了SciZoom，这是一个大规模基准测试，旨在评估分层科学摘要能力，并分析大型语言模型（LLMs）时代下的写作趋势。该基准测试包含来自2020年至2025年顶级机器学习会议的超过44,000篇论文，分为大型语言模型出现前和出现后两个时期。SciZoom提供三个摘要级别——摘要（Abstract）、贡献（Contributions）和简短总结（TL;DR）——并揭示了科学写作的显著变化，例如信心增强和散文同质化，公式化表达增加了10倍，而委婉语则下降了23%。 AI

影响为评估大型语言模型摘要能力和理解受人工智能写作工具影响的科学论述演变性质提供了新资源。

排序理由该集群描述了一个新的学术基准测试和数据集，用于评估大型语言模型在科学摘要方面的能力，并分析写作趋势。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Han Jang, Junhyeok Lee, Kyu Sung Choi · 2026-06-24 04:00

SciZoom: A Large-scale Benchmark for Hierarchical Scientific Summarization across the LLM Era

arXiv:2603.16131v2 Announce Type: replace Abstract: The explosive growth of AI research has created unprecedented information overload, increasing the demand for scientific summarization at multiple levels of granularity beyond traditional abstracts. While LLMs are increasingly a…

报道来源 [1]

SciZoom: A Large-scale Benchmark for Hierarchical Scientific Summarization across the LLM Era

相关实体

相关话题