Frictional Q-Learning algorithm enhances reinforcement learning stability and performance

By PulseAugur Editorial · [1 sources] · 2026-05-08 04:00

Researchers have introduced Frictional Q-Learning, a novel off-policy reinforcement learning algorithm designed to address extrapolation errors. By drawing an analogy to static friction, the method models the replay buffer as a low-dimensional manifold and identifies supported actions as tangent directions. This approach encodes supported actions using a contrastive variational autoencoder, leading to more stable and robust performance on continuous-control benchmarks compared to existing methods. AI

IMPACT Introduces a novel method to improve stability and robustness in off-policy reinforcement learning, potentially enhancing performance in complex control tasks.

RANK_REASON This is a research paper detailing a new algorithm for reinforcement learning. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

paper
other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.LG TIER_1 English(EN) · Hyunwoo Kim, Hyo Kyung Lee · 2026-05-08 04:00

Frictional Q-Learning

arXiv:2509.19771v4 Announce Type: replace Abstract: Off-policy reinforcement learning suffers from extrapolation errors when a learned policy selects actions that are weakly supported in the replay buffer. In this study, we address this issue by drawing an analogy to static frict…

COVERAGE [1]

Frictional Q-Learning

RELATED ENTITIES

RELATED TOPICS