新理论揭示了神经网络特征学习中的突变

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-11 11:39

研究人员开发了一个新的理论框架，以理解神经网络如何学习特征，特别是在大宽度网络中。他们的工作揭示，随着可用数据的增多，特征学习会经历一系列尖锐的、不连续的转变。这种理解带来了精确的“神经网络缩放定律”，该定律根据可学习特征的有效数量和数据预算来确定贝叶斯最优泛化误差。 AI

影响为理解和潜在地改进神经网络的学习方式提供了理论基础，影响了未来的模型开发。

排序理由学术论文，详细介绍了关于神经网络特征学习的理论发现。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv stat.ML TIER_1 English(EN) · Jean Barbier · 2026-05-11 11:39

Sharp feature-learning transitions and Bayes-optimal neural scaling laws in extensive-width networks

We study the information-theoretic limits of learning a one-hidden-layer teacher network with hierarchical features from noisy queries, in the context of knowledge transfer to a smaller student model. We work in the high-dimensional regime where the teacher width $k$ scales linea…