AI research links data geometry to neural network generalization

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-08 04:00

This paper theoretically investigates how data geometry influences generalization in overparameterized neural networks trained below the edge of stability. It derives generalization bounds for two-layer ReLU networks, showing that these bounds adapt to the intrinsic dimension of data distributions. The research indicates that data distributions that are harder to shatter with ReLU activation thresholds lead to better generalization, while data concentrated on a sphere favors memorization. AI

影响 Provides theoretical insights into neural network generalization, potentially guiding future model architectures and training strategies.

排序理由 This is a theoretical research paper published on arXiv concerning neural network generalization. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · Tongtong Liang, Alexander Cloninger, Rahul Parhi, Yu-Xiang Wang · 2026-05-08 04:00

Generalization Below the Edge of Stability: The Role of Data Geometry

arXiv:2510.18120v3 Announce Type: replace-cross Abstract: Understanding generalization in overparameterized neural networks hinges on the interplay between the data geometry, neural architecture, and training dynamics. In this paper, we theoretically explore how data geometry con…

报道来源 [1]

Generalization Below the Edge of Stability: The Role of Data Geometry

相关实体

相关话题