ENTITY
AbstentionBench
AbstentionBench
PulseAugur coverage of AbstentionBench — every cluster mentioning AbstentionBench across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
2 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
Conflicting studies emerge on LLM abstention and chain-of-thought
Two recent papers present conflicting findings on whether large language models can effectively abstain from answering and if chain-of-thought prompting aids this capability. One study from COLING 2025 suggests that pro…
-
LLMs fail at planning and admitting ignorance, new papers show
Two new papers evaluate the metacognitive abilities of large language models, specifically their capacity for planning and abstention. The TRIAGE paper found that most frontier and open-source LLMs perform poorly when t…