ENTITY
Xavier initialization
Xavier initialization
PulseAugur coverage of Xavier initialization — every cluster mentioning Xavier initialization across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
New framework analyzes gradient descent convergence in neural networks
Researchers have developed a new framework to analyze the convergence of gradient descent in neural networks, extending beyond the traditional neural tangent kernel (NTK) regime. This framework applies to a broad range …
-
LLM Study Diary #3: PyTorch tensors, float types, and training infrastructure
This LLM study diary entry focuses on PyTorch fundamentals for training large language models. It details tensor basics, exploring various floating-point data types like FP32, BF16, and FP8 for efficiency and stability.…