PulseAugur
LIVE 15:23:54
ENTITY StorySim

StorySim

PulseAugur coverage of StorySim — every cluster mentioning StorySim across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 1 TOTAL
  1. RESEARCH · CL_06679 ·

    New framework StorySim tests LLMs for Theory of Mind capabilities

    Researchers have developed a new framework called StorySim to evaluate the theory of mind (ToM) and world modeling (WM) capabilities of large language models. This system generates novel stories to test how well LLMs ca…