English(EN) Nothing from Something: Can a Language Model Discover 0?

GPT-2模型在没有示例的情况下难以发现数学概念

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-15 20:54

一项新的研究论文探讨了语言模型（特别是GPT-2大小的模型）发现诸如零之类的数学概念的能力。研究发现，即使经过语言预训练，这些模型在数学发现的分布外泛化方面也存在困难。然而，当模型接受零的示例训练时，性能会显著提高，语言预训练将所需示例的数量减少了约50%。 AI

影响探讨了当前语言模型在抽象数学推理和发现方面的局限性。

排序理由该集群包含一篇详细介绍人工智能模型能力研究结果的学术论文。

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Phoebe Zeng, Thomas L. Griffiths, Brenden M. Lake · 2026-06-17 04:00

Nothing from Something: Can a Language Model Discover 0?

arXiv:2606.17289v1 Announce Type: new Abstract: AI systems based on artificial neural networks are being developed with aspirations of pushing the boundary of human mathematical knowledge. A key question for these systems is how much they can reach beyond their training data. Mat…
arXiv cs.CL TIER_1 English(EN) · Brenden M. Lake · 2026-06-15 20:54

Nothing from Something: Can a Language Model Discover 0?

AI systems based on artificial neural networks are being developed with aspirations of pushing the boundary of human mathematical knowledge. A key question for these systems is how much they can reach beyond their training data. Mathematical discovery requires a strong form of ou…