PulseAugur
LIVE 12:58:54
research · [2 sources] ·
1
research

Paper questions weight decay's role in deep learning stability

A new paper investigates the role of weight decay in deep learning training stability, challenging its common perception as a simple regularization technique. The research analyzes how weight decay affects parameter dynamics and loss sharpness at the "Edge of Stability," demonstrating that it effectively slows down progressive sharpening. The study also reveals an architecture-dependent phase transition, where weight decay dampens oscillations in CNNs but stabilizes sharpness below a theoretical boundary in MLPs, driven by the alignment of parameter vectors and sharpness gradients. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Investigates fundamental mechanisms of training stability, potentially leading to more robust and efficient deep learning model development.

RANK_REASON This is a research paper published on arXiv detailing novel findings about a machine learning technique.

Read on arXiv stat.ML →

COVERAGE [2]

  1. arXiv stat.ML TIER_1 · Marius Saether, Amir Kolic, Tomaso Poggio, Pierfrancesco Beneventano ·

    Does Weight Decay Enhance Training Stability?

    arXiv:2605.16622v1 Announce Type: cross Abstract: In modern deep learning, weight decay is often credited with "stabilizing" training dynamics, diverging from its classical role as a static regularization penalty. We investigate a fundamental question: *does weight decay stabiliz…

  2. arXiv stat.ML TIER_1 · Pierfrancesco Beneventano ·

    Does Weight Decay Enhance Training Stability?

    In modern deep learning, weight decay is often credited with "stabilizing" training dynamics, diverging from its classical role as a static regularization penalty. We investigate a fundamental question: *does weight decay stabilize training dynamics, and if so, through which mech…