SGD Provably Learns Spurious Features First in Neural Networks

By PulseAugur Editorial · [2 sources] · 2026-06-29 15:17

A new theoretical analysis of two-layer ReLU neural networks trained with SGD reveals that the optimization process prioritizes learning spurious correlations over genuine signal features. The study demonstrates that SGD can learn these spurious features exponentially fast, and their presence can actively inhibit the learning of the true signal. The research identifies specific phase transitions in the learning dynamics, showing how the alignment of features and weight signs accelerates spurious learning, while large margins can suppress signal learning. AI

IMPACT Highlights a fundamental challenge in AI training, suggesting that current optimization methods may inherently favor shortcuts, impacting model reliability and generalization.

RANK_REASON Academic paper detailing theoretical analysis of neural network training dynamics. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv stat.ML →

paper
safety

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

SGD Provably Learns Spurious Features First in Neural Networks

COVERAGE [2]

arXiv stat.ML TIER_1 English(EN) · Tyler LaBonte, Vidya Muthukumar · 2026-06-30 04:00

SGD Provably Prioritizes a Shortcut Spurious Feature in the XOR Model

arXiv:2606.30444v1 Announce Type: new Abstract: Neural networks are known to be susceptible to over-reliance on spurious correlations. However, the precise mechanism by which models exploit shortcut features is not fully understood, and algorithms to mitigate this behavior rely o…
arXiv stat.ML TIER_1 English(EN) · Vidya Muthukumar · 2026-06-29 15:17

SGD Provably Prioritizes a Shortcut Spurious Feature in the XOR Model

Neural networks are known to be susceptible to over-reliance on spurious correlations. However, the precise mechanism by which models exploit shortcut features is not fully understood, and algorithms to mitigate this behavior rely on as yet unjustified assumptions about the learn…

COVERAGE [2]

SGD Provably Prioritizes a Shortcut Spurious Feature in the XOR Model

SGD Provably Prioritizes a Shortcut Spurious Feature in the XOR Model

RELATED ENTITIES

RELATED TOPICS