PulseAugur
实时 00:54:53
实体 Offline Reinforcement Learning

Offline Reinforcement Learning

PulseAugur coverage of Offline Reinforcement Learning — every cluster mentioning Offline Reinforcement Learning across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
4
90 天内 4
发布 · 30天
0
90 天内 0
论文 · 30天
4
90 天内 4
层级分布 · 90 天
最近 · 第 1/1 页 · 共 4 条
  1. RESEARCH · CL_29303 ·

    New bootstrap method enhances offline reinforcement learning analysis

    Researchers have developed a new model-based bootstrap method for controlled Markov chains, particularly useful in offline reinforcement learning scenarios where the data-generating policy is unknown. This technique est…

  2. TOOL · CL_21970 ·

    New ME-AM framework enhances offline RL with entropy maximization

    Researchers have introduced Maximum Entropy Adjoint Matching (ME-AM), a new framework designed to improve offline reinforcement learning. This method addresses limitations in existing approaches, such as popularity bias…

  3. RESEARCH · CL_21748 ·

    New Q-Ising method optimizes dynamic treatment allocation on networks

    Researchers have developed Q-Ising, a novel three-stage pipeline for dynamic treatment allocation in networks. This method integrates network structure with dynamic treatment strategies, addressing limitations of existi…

  4. TOOL · CL_16081 ·

    New AdamO optimizer enhances stability and performance in offline RL

    Researchers have introduced AdamO, a novel optimizer designed to enhance stability in offline reinforcement learning. This new optimizer addresses the issue of 'collapse,' where errors in temporal-difference updates can…