PulseAugur
EN
LIVE 17:01:03
ENTITY D4RL

D4RL

PulseAugur coverage of D4RL — every cluster mentioning D4RL across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
9
9 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
9
9 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 1/1 · 9 TOTAL
  1. TOOL · CL_110190 ·

    New ROMI method advances offline reinforcement learning, outperforming prior models

    Researchers have introduced ROMI, a novel method for model-based offline reinforcement learning that addresses key challenges in adversarial model learning. Unlike previous approaches like RAMBO, which struggled with co…

  2. RESEARCH · CL_91432 ·

    New research enhances diffusion models for robust RL and safe planning

    Researchers are developing new methods to improve the robustness and safety of diffusion models in reinforcement learning and planning tasks. One approach, Robust Regularized Policy Iteration (RRPI), addresses transitio…

  3. TOOL · CL_82614 ·

    New MPDiffuser framework enhances diffusion model control for robotics

    Researchers have developed a new framework called Model Predictive Diffuser (MPDiffuser) to improve the reliability of diffusion models in offline decision-making tasks. This approach combines a diffusion planner with a…

  4. RESEARCH · CL_65476 ·

    New research explores Q-learning stability and offline RL methods

    Two new research papers explore advancements in reinforcement learning techniques. One paper introduces Drift Q-Learning, a method that combines a drift-based behavioral regularizer with critic-driven policy improvement…

  5. TOOL · CL_58899 ·

    New MoMa QL framework boosts RL efficiency with moment matching

    Researchers have introduced Moment Matching Q-Learning (MoMa QL), a novel framework designed to address the inference latency issues in score-based and flow-based generative models used in reinforcement learning. MoMa Q…

  6. TOOL · CL_56177 ·

    New SPAR framework improves offline policy improvement in AI

    Researchers have introduced Support-Preserving Action Rectification (SPAR), a novel framework designed to address the inherent conflict in offline policy improvement. SPAR reframes global learning as a local residual re…

  7. RESEARCH · CL_50951 ·

    New research advances policy optimization for robotics and LLMs

    Researchers have introduced several new methods to enhance policy optimization in reinforcement learning, particularly for complex tasks involving robotics and large language models. MODIP aims to efficiently fine-tune …

  8. TOOL · CL_38233 ·

    New COOPO framework boosts reinforcement learning efficiency

    Researchers have developed a new framework called COOPO (Cyclic Offline-Online Policy Optimization) to address limitations in offline and online reinforcement learning. This method repeatedly cycles between offline trai…

  9. TOOL · CL_21965 ·

    SlimDT paper proposes injecting RTG outside sequential modeling

    Researchers have developed SlimDT, a modification of the Decision Transformer (DT) model for offline reinforcement learning. SlimDT removes the Return-to-Go (RTG) token from the autoregressive sequence, instead injectin…