PulseAugur
EN
LIVE 11:31:02

New research explores nonlinear dynamics stability in GD and SGD

Researchers have investigated the stability of nonlinear dynamics in gradient descent (GD) and stochastic gradient descent (SGD) optimization algorithms, moving beyond simplified quadratic potential assumptions. The study derives an exact criterion for stable oscillations in GD near minima, which depends on higher-order derivatives and generalizes existing findings. For SGD, the research indicates that nonlinear dynamics can diverge in expectation due to a single unstable batch, contrasting with linear analysis that suggests an average effect. The paper also proves that if all batches are linearly stable, the nonlinear dynamics of SGD remain stable in expectation. AI

IMPACT Provides a deeper theoretical understanding of optimization algorithms crucial for training large AI models.

RANK_REASON Academic paper published on arXiv. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.LG TIER_1 English(EN) · Rotem Mulayoff, Sebastian U. Stich ·

    On the Stability of Nonlinear Dynamics in GD and SGD: Beyond Quadratic Potentials

    arXiv:2602.14789v2 Announce Type: replace Abstract: The dynamical stability of the iterates during training plays a key role in determining the minima obtained by optimization algorithms. For example, stable solutions of gradient descent (GD) correspond to flat minima, which have…