PulseAugur
LIVE 10:53:01
ENTITY multi-hot cross-entropy

multi-hot cross-entropy

PulseAugur coverage of multi-hot cross-entropy — every cluster mentioning multi-hot cross-entropy across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 1 TOTAL
  1. RESEARCH · CL_22184 ·

    New Token Superposition method slashes LLM pre-training time by 2.5x

    Researchers have developed a new pre-training method called Token-Superposition Training (TST) that aims to make large language model training more efficient. TST involves a two-phase process: an initial superposition p…