Two new arXiv papers explore the spectral dynamics of deep neural networks during training. One paper introduces "Neural Low-Degree Filtering" (Neural LoFi) as a theoretical framework to understand hierarchical feature learning as an iterative spectral procedure. The other paper uses a dynamical mean-field theory to analyze how hidden-weight spectra evolve, predicting outlier behavior and hyperparameter transfer in wide networks. AI
影响 These theoretical frameworks offer new perspectives on how deep neural networks learn, potentially guiding future model development and analysis.
排序理由 Two academic papers published on arXiv presenting theoretical frameworks for understanding deep learning.
- Dynamical Mean-Field Theory (DMFT)
- GPT
- Gradient Descent
- ImageNet
- Dynamical Mean-Field Theory
- Neural Low-Degree Filtering
AI 生成摘要 · Google Gemini · 来自 5 个来源。 我们如何撰写摘要 →