ENTITY
Vishal Misra
Vishal Misra
PulseAugur coverage of Vishal Misra — every cluster mentioning Vishal Misra across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
Bayesian wind tunnels reveal transformer geometric design for inference
Researchers have developed "Bayesian wind tunnels" to rigorously study how transformers perform Bayesian reasoning. These controlled environments allow for the verification of Bayesian posteriors with high accuracy in s…
-
New paper details how cross-entropy training shapes transformer attention
Researchers have analyzed how cross-entropy training shapes attention scores and value vectors within transformer attention heads. Their work introduces an advantage-based routing law for attention scores and a responsi…