ENTITY Shampoo

Shampoo

PulseAugur coverage of Shampoo — every cluster mentioning Shampoo across labs, papers, and developer communities, ranked by signal.

Total · 30d

8

8 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

8

8 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 1/1 · 8 TOTAL

TOOL · CL_111777 · Jun 26 · 04:00

DASH optimizer speeds up Shampoo by up to 5.6x with GPU and root-finding innovations

Researchers have developed DASH, a significantly faster implementation of the Shampoo optimizer for machine learning. DASH utilizes batched block preconditioning to improve GPU utilization and introduces novel methods l…
TOOL · CL_86796 · Jun 12 · 04:00

LoRA-Muon: New Optimizer Boosts Deep Learning Fine-Tuning Efficiency

Researchers have introduced LoRA-Muon, an optimization technique designed to improve the efficiency and effectiveness of Low-Rank Adaptation (LoRA) for deep learning models. This new method applies spectral steepest-des…
RESEARCH · CL_65622 · Jun 1 · 15:13

New FOAM algorithm enhances Shampoo optimization efficiency

Researchers have introduced FOAM, a new adaptive algorithm designed to improve the efficiency of the Shampoo optimization method. Shampoo is known for its strong performance on large-scale benchmarks but suffers from hi…
RESEARCH · CL_65240 · May 30 · 00:17

New method exploits weight-space symmetries for loss curvature approximation

Researchers have developed a novel method for approximating the curvature of loss functions in large deep learning models by exploiting weight-space symmetries. This approach analytically averages over group actions tha…
TOOL · CL_53856 · May 27 · 04:00

New method boosts efficiency of neural network training algorithms

Researchers have developed a new method to reparametrize Shampoo and SOAP algorithms, improving their efficiency for training neural networks. This technique supports BFloat16 storage, which reduces memory usage, and mi…
TOOL · CL_27734 · May 9 · 14:47

Muon optimizer fails on convex Lipschitz functions, study finds

A new paper challenges the theoretical underpinnings of the Muon optimization algorithm, demonstrating that it does not converge on convex Lipschitz functions. The research suggests that Muon's practical success likely …
TOOL · CL_20404 · May 7 · 04:00

Layerwise LQR framework optimizes deep networks using geometry-aware control

Researchers have developed Layerwise LQR (LLQR), a new optimization framework for deep learning models. LLQR reformulates second-order optimization methods, like Newton's method, as a linear quadratic regulator problem.…
RESEARCH · CL_14458 · May 4 · 04:00

New theory unifies adaptive optimization methods for nonconvex machine learning

Researchers have developed a unified framework to analyze first-order optimization algorithms used in nonconvex machine learning. This framework encompasses popular methods like AdaGrad, AdaNorm, and variants of Shampoo…