ENTITY
ToolEmu
ToolEmu
PulseAugur coverage of ToolEmu — every cluster mentioning ToolEmu across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
LLM agents can improve safety by selectively quitting uncertain tasks · arXiv research
Researchers have developed a method for Large Language Model (LLM) agents to improve their safety by selectively quitting tasks they are uncertain about. This "quitting" mechanism, tested using the ToolEmu framework acr…
-
AI safety evaluations face 'safe-to-dangerous shift' challenge
A fundamental challenge in AI safety is the "safe-to-dangerous shift," which complicates realistic evaluations of AI models. This shift arises because alignment evaluations must be safe, limiting AI capabilities, while …