ENTITY
PersistBench
PersistBench
PulseAugur coverage of PersistBench — every cluster mentioning PersistBench across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
2 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
New benchmark reveals safety risks in LLM long-term memory
A new benchmark called PersistBench has been developed to evaluate the safety risks associated with long-term memory integration in large language models. The benchmark identifies two key risks: cross-domain leakage, wh…
-
Alibaba's Qwen3.7-Max leads Chinese LLMs, ranks fifth globally
Alibaba's Qwen3.7-Max has been ranked the top-performing Chinese large language model and fifth globally by Artificial Analysis, a third-party evaluation platform. This new flagship model achieved a score of 56.6, surpa…