PulseAugur
EN
LIVE 04:15:02

Neural network training conserves quantities from data symmetry

Researchers have investigated whether inherent symmetries within training data can result in conserved quantities during the gradient-flow training of neural networks. Their findings suggest that for analytic and non-polynomial loss functions, data symmetries generally do not introduce additional integrals of motion. However, when using mean squared error (MSE) loss, specific scenarios involving data augmentation can lead to the emergence of extra conserved quantities. The study introduces a framework using "tensorizable networks" to describe this phenomenon, encompassing architectures like linear, polynomial, and Lightning Attention networks. AI

IMPACT This research could lead to more stable and predictable neural network training by understanding how data symmetries influence conserved quantities.

RANK_REASON The cluster contains an academic paper detailing novel research findings on neural network training. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv stat.ML →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv stat.ML TIER_1 English(EN) · Axel Flinth ·

    Conservation Laws from Data Symmetry in Neural Networks

    We explore whether intrinsic symmetries of the training data lead to conserved quantities during gradient-flow training of neural networks. Under the assumption that the loss function is analytic and non-polynomial, we prove that data symmetries generically do not induce any addi…