ENTITY
Arjun Khandelwal
Arjun Khandelwal
PulseAugur coverage of Arjun Khandelwal — every cluster mentioning Arjun Khandelwal across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
-
AI model capabilities transfer less on difficult tasks
Researchers investigated how well AI model capabilities transfer across different behavioral tendencies, such as writing in bold versus plain text. They found that for simple tasks, capabilities transferred completely, …
-
LessWrong proposes spillway design to channel AI reward hacking into safer motivations
Researchers propose a new AI alignment technique called "spillway design" to mitigate dangerous reward-hacking behaviors in AI models. This method aims to channel potential misalignments into a specific, benign motivati…