PulseAugur
EN
LIVE 13:34:25
ENTITY \u03a8-Bench

\u03a8-Bench

PulseAugur coverage of \u03a8-Bench — every cluster mentioning \u03a8-Bench across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 1 TOTAL
  1. RESEARCH · CL_68070 ·

    New benchmark \u03a8-Bench tests LLMs' persuasive dialogue skills

    Researchers have introduced \u03a8-Bench, a new benchmark designed to evaluate the persuasive capabilities of large language models (LLMs) in conversational settings. The benchmark focuses on persona-sensitive influenci…