AI research links data geometry to neural network generalization

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

This paper theoretically investigates how data geometry influences generalization in overparameterized neural networks trained below the edge of stability. It derives generalization bounds for two-layer ReLU networks, showing that these bounds adapt to the intrinsic dimension of data distributions. The research indicates that data distributions that are harder to shatter with ReLU activation thresholds lead to better generalization, while data concentrated on a sphere favors memorization. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Provides theoretical insights into neural network generalization, potentially guiding future model architectures and training strategies.

RANK_REASON This is a theoretical research paper published on arXiv concerning neural network generalization. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

paper
other

COVERAGE [1]

arXiv cs.LG TIER_1 · Tongtong Liang, Alexander Cloninger, Rahul Parhi, Yu-Xiang Wang · 2026-05-08 04:00

Generalization Below the Edge of Stability: The Role of Data Geometry

arXiv:2510.18120v3 Announce Type: replace-cross Abstract: Understanding generalization in overparameterized neural networks hinges on the interplay between the data geometry, neural architecture, and training dynamics. In this paper, we theoretically explore how data geometry con…

COVERAGE [1]

Generalization Below the Edge of Stability: The Role of Data Geometry

RELATED ENTITIES

RELATED TOPICS