reinforcement learning
PulseAugur coverage of reinforcement learning — every cluster mentioning reinforcement learning across labs, papers, and developer communities, ranked by signal.
- instance of SOFT ACTOR-CRITIC REINFORCEMENT LEARNING FOR ROBOTIC MANIPULATOR WITH HINDSIGHT EXPERIENCE REPLAY 95%
- used by large-language models 90%
- used by Grpo 90%
- used by Markov decision process 90%
- used by large language model 90%
- used by Soft Actor--Critic 90%
- developed by large-language models 70%
- developed by Grpo 70%
- used by robotics 70%
- used by supervised fine-tuning 70%
- used by Group Relative Policy Optimization 70%
- employs Diffusion Models 70%
- 2026-05-18 research_milestone A new paper proposes a reinforcement learning framework for modeling customer trajectories in retail. source
26 day(s) with sentiment data
-
AI's adaptive control debated: self-learning vs. human rules
This cluster discusses the concept of adaptive control in AI, specifically focusing on reinforcement learning. It poses the question of whether machines should autonomously learn and adapt or adhere strictly to human-de…
-
New RL policy enables scalable, persona-driven NPCs in games
Researchers have developed a novel reinforcement learning policy called pcsp, designed to enable scalable and controllable non-player characters (NPCs) in life-simulation games. This single policy is conditioned on LLM …
-
RL framework automates security protocol analysis in Tamarin
Researchers have developed a reinforcement learning (RL) framework to automate and shorten the process of analyzing security protocols using the Tamarin tool. This new method, inspired by AlphaZero, employs a neural heu…
-
New RL framework generates precisely constrained graphs
Researchers have developed a new reinforcement learning framework called the Deep Microcanonical Graph Generator (DMGG) to create graphs with precisely controlled structural properties. This method allows for exact enfo…
-
CellFluxRL uses RL to create biologically accurate virtual cells
Researchers have developed CellFluxRL, a novel framework for creating virtual cells that adhere to biological and physical constraints. This approach uses reinforcement learning with biologically meaningful reward funct…
-
New framework tests AI vs traditional network congestion controllers
Researchers have developed CCLab, a new framework designed to test the robustness of network congestion controllers, including both learning-based and traditional algorithms. The framework uses a reinforcement learning …
-
TimeRewarder learns dense rewards from passive videos for RL
Researchers have developed TimeRewarder, a novel method for learning dense reward signals from passive videos. This technique models temporal distances between frame pairs to estimate task progress, which can then guide…
-
New bounds enhance statistical inference for Reinforcement Learning
Researchers have developed new high-dimensional concentration inequalities and Berry-Esseen bounds for martingales induced by Markov chains. These findings are applied to analyze Temporal Difference (TD) learning with l…
-
Study shows training data curriculum fine-tunes RL agent specialization
A new study on arXiv explores how different training data curricula impact the performance of reinforcement learning (RL) agents designed to work with large language models (LLMs) and external memory banks. The research…
-
Reinforcement learning optimizes ion shuttling for quantum computers
Researchers have developed a novel reinforcement learning (RL) approach to optimize ion shuttling on trapped-ion quantum computers. This method addresses the high-dimensional optimization challenge that arises with incr…
-
AI enhances serious games with adaptive learning and dynamic scenarios
A new chapter explores the integration of artificial intelligence into serious games, aiming to overcome limitations like static scenarios and authoring bottlenecks. It discusses how AI, including LLMs and reinforcement…
-
New Mem-π Framework Enhances LLM Agent Memory with Dynamic Guidance Generation
Researchers have developed Mem-π, a novel framework designed to enhance the adaptive memory capabilities of large language model (LLM) agents. Unlike traditional methods that rely on static retrieval from memory banks, …
-
Robot Tactile Olympiad benchmark accelerates blind manipulation tasks
Researchers have introduced roto 2.0, a new benchmark for tactile-based reinforcement learning in robotics. This benchmark utilizes GPU parallelism and focuses on end-to-end "blind" manipulation tasks across four differ…
-
Reinforcement learning optimizes urban street design and traffic signals
Researchers have developed DeCoR, a novel reinforcement learning framework designed to optimize urban street design and traffic signal control. The system first learns to generate optimal crosswalk layouts by encoding p…
-
New RL policies boost efficiency with one-step generative control
Researchers have developed new methods for reinforcement learning policies that aim to improve efficiency and expressiveness. One approach, Score-Based One-step MeanFlow Policy Optimization (SOM), constructs a target ve…
-
PREFINE method enhances AI safety alignment using preference tuning
Researchers have developed PREFINE, a novel method for adapting pre-trained reinforcement learning policies to incorporate safety constraints without full retraining. This technique leverages trajectory-level preference…
-
Quantum RL advances VQA state prep and process synthesis
Researchers have developed a new framework called CRiSP that uses reinforcement learning and Transformer-based policies to improve the initial state preparation for Variational Quantum Algorithms (VQAs). This method aim…
-
New YANN-RL method speeds up AI control for chemical processes
Researchers have developed a new reinforcement learning (RL) approach called Y-wise Affine Neural Network (YANN-RL) designed for control in chemical process systems. This method aims to overcome the typical challenges o…
-
AI research advances autonomous driving safety with new RL frameworks
Two new research papers explore advanced reinforcement learning techniques for safer autonomous driving. The first paper introduces a multi-agent reinforcement learning (MARL) approach where self-driving cars and pedest…
-
New PG-DPO framework enhances reinforcement learning for non-exponential discounting
Researchers have developed a new framework called Pontryagin-Guided Direct Policy Optimization (PG-DPO) to address limitations in reinforcement learning methods. Traditional approaches using Bellman-style recursions str…