ENTITY
Howard H Chen
Howard H Chen
PulseAugur coverage of Howard H Chen — every cluster mentioning Howard H Chen across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
Research suggests reinforcement learning reduces language model forgetting
A new research paper titled "Retaining by Doing" explores how to mitigate catastrophic forgetting in language models during post-training adaptation. The study compares supervised fine-tuning (SFT) with reinforcement le…
-
New REMIX method combats language model forgetting of facts
A new research paper introduces a method called REMIX (Random and Generic Data Mixing) to address the issue of language models forgetting previously learned information when updated with new data. The study, led by Howa…