Lilian Weng's post dives deep into the math behind Neural Tangent Kernel

By PulseAugur Editorial · [1 sources] · 2022-09-08 17:00

Lilian Weng's blog post delves into the mathematical underpinnings of the Neural Tangent Kernel (NTK), a concept used to explain the training dynamics of neural networks. The post focuses on NTK's definition and proofs, particularly how infinitely wide neural networks converge to a global minimum during gradient descent. It reviews foundational mathematical concepts like vector-to-vector derivatives, ordinary differential equations, the Central Limit Theorem, and Taylor expansions, which are essential for understanding NTK. AI

RANK_REASON Blog post providing a deep dive into the mathematical theory behind the Neural Tangent Kernel, referencing core academic papers.

Read on Lil'Log (Lilian Weng) →

paper
other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

Lil'Log (Lilian Weng) TIER_1 English(EN) · 2022-09-08 17:00

Some Math behind Neural Tangent Kernel

<p>Neural networks are <a href="https://lilianweng.github.io/posts/2019-03-14-overfit/">well known</a> to be over-parameterized and can often easily fit data with near-zero training loss with decent generalization performance on test dataset. Although all these parameters are ini…

COVERAGE [1]

Some Math behind Neural Tangent Kernel

RELATED TOPICS