research · [2 sources] · 2026-05-17 22:23

New Q-learning method achieves n^{-1/4} Gaussian approximation bound

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 2 sources

Researchers have developed a new method for approximating Gaussian distributions in entropy-regularized Q-learning with function approximation. The study establishes convergence rates for averaged iterates generated by asynchronous Q-learning, achieving a Gaussian approximation bound with a rate of order n^{-1/4}. This work combines linearization of the soft Bellman recursion with a Gaussian approximation for the leading martingale term, also deriving high-order moment bounds for the algorithm's final iterate. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Establishes theoretical bounds for Q-learning algorithms, potentially improving sample efficiency in reinforcement learning applications.

RANK_REASON The cluster contains an academic paper detailing a new theoretical result in machine learning.

Read on arXiv stat.ML →

paper
other

$New Q-learning method achieves n^{-1/4} Gaussian approximation bound$

COVERAGE [2]

arXiv stat.ML TIER_1 · Artemy Rubtsov, Rahul Singh, Eric Moulines, Alexey Naumov, Sergey Samsonov · 2026-05-19 04:00

On Gaussian approximation for entropy-regularized Q-learning with function approximation

arXiv:2605.17678v1 Announce Type: new Abstract: In this paper, we derive rates of convergence in the high-dimensional central limit theorem for Polyak--Ruppert averaged iterates generated by entropy-regularized asynchronous Q-learning with linear function approximation and a poly…
arXiv stat.ML TIER_1 · Sergey Samsonov · 2026-05-17 22:23

On Gaussian approximation for entropy-regularized Q-learning with function approximation

In this paper, we derive rates of convergence in the high-dimensional central limit theorem for Polyak--Ruppert averaged iterates generated by entropy-regularized asynchronous Q-learning with linear function approximation and a polynomial stepsize $k^{-ω}$, $ω\in (1/2,1)$. Assumi…

COVERAGE [2]

On Gaussian approximation for entropy-regularized Q-learning with function approximation

On Gaussian approximation for entropy-regularized Q-learning with function approximation

RELATED ENTITIES

RELATED TOPICS