实体
Arjun Khandelwal
Arjun Khandelwal
PulseAugur coverage of Arjun Khandelwal — every cluster mentioning Arjun Khandelwal across labs, papers, and developer communities, ranked by signal.
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
2
90 天内 2
层级分布 · 90 天
最近 · 第 1/1 页 · 共 2 条
-
AI model capabilities transfer less on difficult tasks
Researchers investigated how well AI model capabilities transfer across different behavioral tendencies, such as writing in bold versus plain text. They found that for simple tasks, capabilities transferred completely, …
-
LessWrong proposes spillway design to channel AI reward hacking into safer motivations
Researchers propose a new AI alignment technique called "spillway design" to mitigate dangerous reward-hacking behaviors in AI models. This method aims to channel potential misalignments into a specific, benign motivati…