PulseAugur
EN
LIVE 16:23:44
ENTITY AmBench

AmBench

PulseAugur coverage of AmBench — every cluster mentioning AmBench across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
TOPICS
TIMELINE
  1. 2026-04-28 research_milestone Researchers introduce AmBench, a benchmark demonstrating LLMs' struggles with recognizing human names, impacting privacy. source
RECENT · PAGE 1/1 · 1 TOTAL
  1. RESEARCH · CL_30939 ·

    LLMs fail to reliably recognize names, impacting privacy tools

    A new benchmark, AmBench, reveals that large language models struggle to reliably recognize human names, a critical component for privacy protection tools. Researchers found that LLMs mishandle ambiguous names, leading …