ENTITY
VitaBench 2.0
VitaBench 2.0
PulseAugur coverage of VitaBench 2.0 — every cluster mentioning VitaBench 2.0 across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
Meituan LongCat releases VitaBench 2.0 for LLM user modeling
Meituan's LongCat team has released VitaBench 2.0, an evaluation benchmark designed for assessing large language models in long-term, dynamic user interaction scenarios. This new version focuses on the models' ability t…
-
New benchmark VitaBench 2.0 tests LLM agents' personalization
Researchers have introduced VitaBench 2.0, a new benchmark designed to evaluate the personalization and proactivity of large language model agents in long-term user interactions. This benchmark addresses the limitations…