Two papers, one from 2016 by Zhang et al. and another from 2019 by Nagarajan and Kolter, are discussed for their impact on deep learning theory. The 2016 paper demonstrated that standard neural networks could easily memorize random data, challenging existing theories of generalization based on hypothesis class complexity. Subsequent research attempted to develop data-dependent bounds, but the 2019 paper is presented as a further blow to these efforts, suggesting that uniform convergence may be insufficient to explain deep learning's success. AI
影响 Challenges existing theoretical frameworks for understanding deep learning generalization, potentially redirecting future research.
排序理由 The cluster discusses academic papers and their theoretical implications for deep learning.
- Bartlett, Foster, and Telgarsky
- Hardt, Recht, and Singer
- ImageNet
- Nagarajan and Kolter
- Neyshabur, Bhojanapalli, and Srebro
- Rademacher complexity
- SGD
- statistical learning theory
- Valiant
- VC dimension
- generalization bounds
AI 生成摘要 · Google Gemini · 来自 4 个来源。 我们如何撰写摘要 →