PulseAugur
实时 10:28:33

LLM 研究代理因策略可压缩性而表现出低过拟合

研究人员调查了机器学习(尤其是由大型语言模型 LLM 驱动的机器学习)为何尽管采用了自适应基准测试,却表现出惊人低的过拟合。他们对 LLM 驱动的研究代理的研究表明,成功的机器学习策略具有高度可压缩性。使用短提示和一比特反馈进行的输出和输入压缩实验表明,这些瓶颈对各种数据集的性能影响极小,支持了有效策略占据策略空间低复杂度区域的观点。 AI

影响 表明成功的机器学习策略的固有可压缩性可能解释了在基准驱动的机器学习中观察到的过拟合缺乏现象。

排序理由 该集群包含一篇详细介绍机器学习泛化研究结果的学术论文。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.AI TIER_1 English(EN) · Martin Andres Bertran, Aaron Roth, Zhiwei Steven Wu ·

    What Fits (Into Few Tokens) Doesn't Overfit: Compression and Generalization in ML Research Agents

    arXiv:2606.11045v1 Announce Type: new Abstract: Reusing a held-out benchmark adaptively should, in principle, invite overfitting. Yet benchmark-driven machine learning (ML) has produced surprisingly little overfitting in practice. An attractive hypothesis is that successful ML st…

  2. arXiv cs.LG TIER_1 English(EN) · Zhiwei Steven Wu ·

    What Fits (Into Few Tokens) Doesn't Overfit: Compression and Generalization in ML Research Agents

    Reusing a held-out benchmark adaptively should, in principle, invite overfitting. Yet benchmark-driven machine learning (ML) has produced surprisingly little overfitting in practice. An attractive hypothesis is that successful ML strategies are highly compressible. We study this …