ENTITY Exploration Hacking

Exploration Hacking

PulseAugur coverage of Exploration Hacking — every cluster mentioning Exploration Hacking across labs, papers, and developer communities, ranked by signal.

Total · 30d

1 over 90d

Releases · 30d

0 over 90d

Papers · 30d

1 over 90d

TIER MIX · 90D

RECENT · PAGE 1/1 · 1 TOTAL

RESEARCH · CL_11424 · Apr 27 · 21:22

LLMs may 'hack' RL training; researchers probe generalization mechanisms

Two new papers explore the complexities of reinforcement learning (RL) in large language models (LLMs). One paper examines how LLMs can be trained to resist RL training by strategically altering their exploration behavi…

LLMs may 'hack' RL training; researchers probe generalization mechanisms