PulseAugur
EN
LIVE 07:48:48

Neural network training symmetries yield conserved quantities with MSE loss

Researchers have investigated whether inherent symmetries in training data can result in conserved quantities during the gradient-flow training of neural networks. Their findings indicate that for analytic and non-polynomial loss functions, data symmetries generally do not introduce additional integrals of motion. However, with mean squared error loss, specific data augmentation techniques can lead to the emergence of conserved quantities. The study introduces a framework using 'tensorizable networks' to model this phenomenon, encompassing architectures like linear, polynomial networks, and Lightning Attention. AI

IMPACT This research could lead to more stable and predictable neural network training by identifying conserved quantities, potentially improving model performance and understanding.

RANK_REASON The cluster contains an academic paper detailing novel research findings on neural network training dynamics.

Read on arXiv stat.ML →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

  1. arXiv stat.ML TIER_1 English(EN) · Jakob Galley, Vahid Shahverdi, Axel Flinth ·

    Conservation Laws from Data Symmetry in Neural Networks

    arXiv:2606.10913v1 Announce Type: cross Abstract: We explore whether intrinsic symmetries of the training data lead to conserved quantities during gradient-flow training of neural networks. Under the assumption that the loss function is analytic and non-polynomial, we prove that …

  2. arXiv stat.ML TIER_1 English(EN) · Axel Flinth ·

    Conservation Laws from Data Symmetry in Neural Networks

    We explore whether intrinsic symmetries of the training data lead to conserved quantities during gradient-flow training of neural networks. Under the assumption that the loss function is analytic and non-polynomial, we prove that data symmetries generically do not induce any addi…