ENTITY Offline Reinforcement Learning

Offline Reinforcement Learning

PulseAugur coverage of Offline Reinforcement Learning — every cluster mentioning Offline Reinforcement Learning across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

8 over 90d

Releases · 30d

0 over 90d

Papers · 30d

8 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 8 TOTAL

TOOL · CL_100180 · Jun 19 · 04:00

New dataset Insulin4RL enables offline reinforcement learning with irregular clinical data

Researchers have introduced Insulin4RL, a new dataset designed for offline reinforcement learning in healthcare settings. This dataset, derived from MIMIC-IV, contains over 375,000 decisions from 12,209 intensive care u…
TOOL · CL_80057 · Jun 9 · 04:00

New framework refines offline RL trajectories using counterfactual flows

Researchers have introduced a new framework called counterfactual transport flows for offline reinforcement learning. This method aims to improve decision-making policies using only logged historical data, without extra…
TOOL · CL_79775 · Jun 9 · 04:00

New benchmark standardizes offline RL for nuclear fusion plasma control

Researchers have introduced RL4F, a new benchmark designed to standardize the evaluation of offline reinforcement learning for plasma control in nuclear fusion. This benchmark utilizes historical data from the DIII-D to…
TOOL · CL_58992 · May 29 · 04:00

New TrojanTO attack targets trajectory optimization models in RL

Researchers have developed TrojanTO, a novel method for executing action-level backdoor attacks against trajectory optimization (TO) models used in offline reinforcement learning. Unlike previous reward-manipulation att…
RESEARCH · CL_29303 · May 12 · 17:05

New bootstrap method enhances offline reinforcement learning analysis

Researchers have developed a new model-based bootstrap method for controlled Markov chains, particularly useful in offline reinforcement learning scenarios where the data-generating policy is unknown. This technique est…
TOOL · CL_21970 · May 8 · 04:00

New ME-AM framework enhances offline RL with entropy maximization

Researchers have introduced Maximum Entropy Adjoint Matching (ME-AM), a new framework designed to improve offline reinforcement learning. This method addresses limitations in existing approaches, such as popularity bias…
RESEARCH · CL_21748 · May 7 · 16:58

New Q-Ising method optimizes dynamic treatment allocation on networks

Researchers have developed Q-Ising, a novel three-stage pipeline for dynamic treatment allocation in networks. This method integrates network structure with dynamic treatment strategies, addressing limitations of existi…
TOOL · CL_16081 · May 5 · 04:00

New AdamO optimizer enhances stability and performance in offline RL

Researchers have introduced AdamO, a novel optimizer designed to enhance stability in offline reinforcement learning. This new optimizer addresses the issue of 'collapse,' where errors in temporal-difference updates can…

New dataset Insulin4RL enables offline reinforcement learning with irregular clinical data

New framework refines offline RL trajectories using counterfactual flows

New benchmark standardizes offline RL for nuclear fusion plasma control

New TrojanTO attack targets trajectory optimization models in RL

New bootstrap method enhances offline reinforcement learning analysis

New ME-AM framework enhances offline RL with entropy maximization

New Q-Ising method optimizes dynamic treatment allocation on networks

New AdamO optimizer enhances stability and performance in offline RL