PulseAugur
LIVE 10:13:55
ENTITY Yuqian Fu

Yuqian Fu

PulseAugur coverage of Yuqian Fu — every cluster mentioning Yuqian Fu across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 1 TOTAL
  1. RESEARCH · CL_06734 ·

    Researchers refine on-policy distillation for more stable LLM training

    Researchers have identified significant empirical failure modes in on-policy distillation (OPD), a technique used for post-training large language models. The standard implementation, which relies on sampled-token log-r…