PulseAugur
LIVE 14:10:14
ENTITY Dengzhe Hou

Dengzhe Hou

PulseAugur coverage of Dengzhe Hou — every cluster mentioning Dengzhe Hou across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 1 TOTAL
  1. TOOL · CL_18638 ·

    New WMF-AM benchmark probes LLM working memory and cumulative state tracking

    Researchers have developed a new evaluation method called Working Memory Fidelity-Active Manipulation (WMF-AM) to specifically test the cumulative state tracking abilities of large language models. This probe measures h…