StableGrad stabilizes deep neural network training without batch normalization

By PulseAugur Editorial · [1 sources] · 2026-05-19 13:49

Researchers have introduced StableGrad, a novel optimizer-level mechanism designed to control the scale of activations and gradients in deep neural networks. This method aims to prevent training instability without relying on traditional batch normalization, which can be problematic for applications like Physics-Informed Neural Networks (PINNs). StableGrad operates by adjusting weight-gradient imbalances after backpropagation but before the optimizer update, thereby preserving the network's forward pass and physical residual accuracy. Evaluations on deep PINNs and standard architectures like ResNet and EfficientNet demonstrated StableGrad's effectiveness in improving accuracy and stabilizing optimization, even when batch normalization is removed. AI

IMPACT Offers a new technique to stabilize deep neural network training, particularly beneficial for physics-informed models where standard normalization methods are unsuitable.

RANK_REASON The cluster contains a new academic paper detailing a novel method for training neural networks. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

paper
infra

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

StableGrad stabilizes deep neural network training without batch normalization

COVERAGE [1]

arXiv cs.AI TIER_1 English(EN) · Enrique S. Quintana-Ortí · 2026-05-19 13:49

StableGrad: Backward Scale Control without Batch Normalization

Training very deep neural networks requires controlling the propagation of magnitudes across depth. Without such control, activations and gradients may vanish, explode, or enter unstable regimes that make optimization fail. Modern architectures often mitigate this problem through…

COVERAGE [1]

StableGrad: Backward Scale Control without Batch Normalization

RELATED ENTITIES

RELATED TOPICS