ENTITY D4RL

D4RL

PulseAugur coverage of D4RL — every cluster mentioning D4RL across labs, papers, and developer communities, ranked by signal.

Total · 30d

9

9 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

9

9 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 1/1 · 9 TOTAL

TOOL · CL_110190 · Jun 25 · 06:42

New ROMI method advances offline reinforcement learning, outperforming prior models

Researchers have introduced ROMI, a novel method for model-based offline reinforcement learning that addresses key challenges in adversarial model learning. Unlike previous approaches like RAMBO, which struggled with co…
RESEARCH · CL_91432 · Jun 15 · 04:00

New research enhances diffusion models for robust RL and safe planning

Researchers are developing new methods to improve the robustness and safety of diffusion models in reinforcement learning and planning tasks. One approach, Robust Regularized Policy Iteration (RRPI), addresses transitio…
TOOL · CL_82614 · Jun 10 · 04:00

New MPDiffuser framework enhances diffusion model control for robotics

Researchers have developed a new framework called Model Predictive Diffuser (MPDiffuser) to improve the reliability of diffusion models in offline decision-making tasks. This approach combines a diffusion planner with a…
RESEARCH · CL_65476 · May 31 · 15:46

New research explores Q-learning stability and offline RL methods

Two new research papers explore advancements in reinforcement learning techniques. One paper introduces Drift Q-Learning, a method that combines a drift-based behavioral regularizer with critic-driven policy improvement…
TOOL · CL_58899 · May 29 · 04:00

New MoMa QL framework boosts RL efficiency with moment matching

Researchers have introduced Moment Matching Q-Learning (MoMa QL), a novel framework designed to address the inference latency issues in score-based and flow-based generative models used in reinforcement learning. MoMa Q…
TOOL · CL_56177 · May 28 · 04:00

New SPAR framework improves offline policy improvement in AI

Researchers have introduced Support-Preserving Action Rectification (SPAR), a novel framework designed to address the inherent conflict in offline policy improvement. SPAR reframes global learning as a local residual re…
RESEARCH · CL_50951 · May 26 · 04:00

New research advances policy optimization for robotics and LLMs

Researchers have introduced several new methods to enhance policy optimization in reinforcement learning, particularly for complex tasks involving robotics and large language models. MODIP aims to efficiently fine-tune …
TOOL · CL_38233 · May 18 · 17:15

New COOPO framework boosts reinforcement learning efficiency

Researchers have developed a new framework called COOPO (Cyclic Offline-Online Policy Optimization) to address limitations in offline and online reinforcement learning. This method repeatedly cycles between offline trai…
TOOL · CL_21965 · May 8 · 04:00

SlimDT paper proposes injecting RTG outside sequential modeling

Researchers have developed SlimDT, a modification of the Decision Transformer (DT) model for offline reinforcement learning. SlimDT removes the Return-to-Go (RTG) token from the autoregressive sequence, instead injectin…