ENTITY
Offline RL
Offline RL
PulseAugur coverage of Offline RL — every cluster mentioning Offline RL across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
New GORMPO algorithm improves offline RL with generative density modeling
Researchers have developed a new offline reinforcement learning algorithm called Generative OOD-regularized Model-based Policy Optimization (GORMPO). This method integrates generative models to explicitly model density …
-
Offline RL training on logs can be deceptive, study finds
Training AI models using production logs can be misleading, as a recent exploration into offline Reinforcement Learning (RL) revealed. The study found that relying solely on logged data can result in models that appear …