PulseAugur
实时 23:16:55

Paper questions weight decay's role in deep learning stability

A new paper investigates the role of weight decay in deep learning training stability, challenging its common perception as a simple regularization technique. The research analyzes how weight decay affects parameter dynamics and loss sharpness at the "Edge of Stability," demonstrating that it effectively slows down progressive sharpening. The study also reveals an architecture-dependent phase transition, where weight decay dampens oscillations in CNNs but stabilizes sharpness below a theoretical boundary in MLPs, driven by the alignment of parameter vectors and sharpness gradients. AI

影响 Investigates fundamental mechanisms of training stability, potentially leading to more robust and efficient deep learning model development.

排序理由 This is a research paper published on arXiv detailing novel findings about a machine learning technique.

在 arXiv stat.ML 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

Paper questions weight decay's role in deep learning stability

报道来源 [2]

  1. arXiv stat.ML TIER_1 English(EN) · Marius Saether, Amir Kolic, Tomaso Poggio, Pierfrancesco Beneventano ·

    Does Weight Decay Enhance Training Stability?

    arXiv:2605.16622v1 Announce Type: cross Abstract: In modern deep learning, weight decay is often credited with "stabilizing" training dynamics, diverging from its classical role as a static regularization penalty. We investigate a fundamental question: *does weight decay stabiliz…

  2. arXiv stat.ML TIER_1 English(EN) · Pierfrancesco Beneventano ·

    Does Weight Decay Enhance Training Stability?

    In modern deep learning, weight decay is often credited with "stabilizing" training dynamics, diverging from its classical role as a static regularization penalty. We investigate a fundamental question: *does weight decay stabilize training dynamics, and if so, through which mech…