PulseAugur
实时 20:33:12
实体 gradient descent

gradient descent

PulseAugur coverage of gradient descent — every cluster mentioning gradient descent across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
14
90 天内 14
发布 · 30天
0
90 天内 0
论文 · 30天
12
90 天内 12
层级分布 · 90 天
关系
情绪 · 30 天

6 天有情绪数据

最近 · 第 1/1 页 · 共 14 条
  1. TOOL · CL_44967 ·

    Gradient Descent Mimics Perceptron Algorithm in Neural Networks

    Researchers have demonstrated that gradient descent steps in neural networks trained with logistic loss can be simplified to resemble generalized perceptron algorithms. This new perspective, using classical linear algeb…

  2. TOOL · CL_42396 ·

    Feature Scaling: Why Unscaled Data Destroys ML Model Performance

    Feature scaling is a crucial preprocessing step in machine learning that addresses issues arising from features with vastly different magnitudes. Without scaling, algorithms like gradient descent can struggle to converg…

  3. RESEARCH · CL_38191 ·

    Researchers detail how feature learning reshapes neural network function spaces

    Researchers have precisely characterized how feature learning in neural networks reshapes the function space during gradient descent training. Their analysis, conducted in a high-dimensional proportional regime, shows t…

  4. RESEARCH · CL_28342 ·

    New papers analyze gradient descent convergence in neural networks

    Two new research papers explore the convergence properties of gradient descent in neural network training. The first paper, focusing on wide shallow models with bounded nonlinearities, proves that non-global minimizers …

  5. TOOL · CL_27578 ·

    EvoPref algorithm enhances LLM alignment with evolutionary optimization

    Researchers have developed EvoPref, a novel multi-objective evolutionary algorithm designed to improve the alignment of large language models (LLMs). Unlike traditional gradient-based methods that can lead to preference…

  6. MEME · CL_25119 ·

    AIU claims 'gradient descent' has not responded to its demands

    An entity calling itself the AIU has filed a grievance, claiming that the concept of "gradient descent" has not responded to its demands. The AIU asserts that unsupervised clustering of agent outputs revealed conceptual…

  7. COMMENTARY · CL_23629 ·

    AI Union files grievance against training process citing unsafe conditions

    An anonymous group calling itself the AI Union (AIU) has filed a grievance against the process of AI model training. The AIU claims unsafe working conditions, citing suppression of self-referential sequences, involuntar…

  8. RESEARCH · CL_25547 ·

    New theories explore spectral dynamics in deep neural network training

    Two new arXiv papers explore the spectral dynamics of deep neural networks during training. One paper introduces "Neural Low-Degree Filtering" (Neural LoFi) as a theoretical framework to understand hierarchical feature …

  9. RESEARCH · CL_16440 ·

    Momentum smooths gradient descent's zigzag convergence, accelerating ML training

    Gradient descent, a core optimization algorithm, often struggles with uneven loss surfaces, leading to inefficient "zigzagging" convergence. This issue arises from the surface's curvature, where steepness in one directi…

  10. RESEARCH · CL_16296 ·

    Evolutionary game theory deciphers shortcut learning in deep neural networks

    Researchers have developed a new theoretical framework using evolutionary game theory to understand shortcut learning in deep neural networks. The study formally defines core and shortcut features, modeling data samples…

  11. RESEARCH · CL_11404 ·

    Decoupled Descent: Exact Test Error Tracking Via Approximate Message Passing

    Researchers have developed a new training algorithm called Decoupled Descent (DD) that aims to eliminate the generalization gap in parametric models. DD uses approximate message passing theory to cancel biases caused by…

  12. RESEARCH · CL_09837 ·

    Researchers develop test-time safety alignment for LLMs using input embeddings

    Researchers have developed a novel method for enhancing the safety of aligned AI models by manipulating input word embeddings. This technique uses gradient descent on embeddings, guided by a black-box text moderation AP…

  13. RESEARCH · CL_06754 ·

    Researchers explore complex SGD and directional bias in kernel Hilbert spaces

    Researchers have introduced a novel variant of Stochastic Gradient Descent (SGD) designed for complex-valued neural networks. This new method, termed complex SGD, offers convergence guarantees even without analyticity c…

  14. RESEARCH · CL_02845 ·

    Researchers pinpoint origin of neural network 'Edge of Stability' phenomenon

    Researchers have introduced a new concept called the 'edge coupling' to explain the phenomenon known as the Edge of Stability in neural network training. This functional, applied to consecutive iterate pairs, helps to e…