PulseAugur
实时 08:12:09
实体 multi-hot cross-entropy

multi-hot cross-entropy

PulseAugur coverage of multi-hot cross-entropy — every cluster mentioning multi-hot cross-entropy across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
1
90 天内 1
发布 · 30天
0
90 天内 0
论文 · 30天
1
90 天内 1
层级分布 · 90 天
最近 · 第 1/1 页 · 共 1 条
  1. RESEARCH · CL_22184 ·

    New Token Superposition method slashes LLM pre-training time by 2.5x

    Researchers have developed a new pre-training method called Token-Superposition Training (TST) that aims to make large language model training more efficient. TST involves a two-phase process: an initial superposition p…