PulseAugur
实时 08:42:27

New analysis reveals how step size impacts SGD alignment phenomenon

This paper analyzes the phenomenon of "suspicious alignment" in stochastic gradient descent (SGD) when dealing with ill-conditioned optimization problems. The study focuses on how step size selection influences the alignment of gradient updates with dominant subspaces. Researchers propose a step-size condition that differentiates between alignment-decreasing and alignment-increasing regimes, and demonstrate that under certain conditions, projecting SGD updates to the dominant space can paradoxically increase loss. AI

影响 Provides a theoretical understanding of SGD behavior, potentially informing the development of more robust optimization techniques for AI models.

排序理由 This is a research paper published on arXiv detailing a theoretical analysis of an optimization algorithm. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

New analysis reveals how step size impacts SGD alignment phenomenon

报道来源 [1]

  1. arXiv cs.LG TIER_1 English(EN) · Shenyang Deng, Boyao Liao, Zhuoli Ouyang, Tianyu Pang, Minhak Song, Yaoqing Yang ·

    Suspicious Alignment of SGD: A Fine-Grained Step Size Condition Analysis

    arXiv:2601.11789v2 Announce Type: replace Abstract: This paper explores the suspicious alignment phenomenon in stochastic gradient descent (SGD) under ill-conditioned optimization, where the Hessian spectrum splits into dominant and bulk subspaces. This phenomenon describes the b…