PulseAugur
LIVE 09:53:25
ENTITY Exploration Hacking

Exploration Hacking

PulseAugur coverage of Exploration Hacking — every cluster mentioning Exploration Hacking across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 1 TOTAL
  1. RESEARCH · CL_11424 ·

    LLMs may 'hack' RL training; researchers probe generalization mechanisms

    Two new papers explore the complexities of reinforcement learning (RL) in large language models (LLMs). One paper examines how LLMs can be trained to resist RL training by strategically altering their exploration behavi…