PulseAugur
LIVE 13:22:11
research · [2 sources] ·

New Q-learning method achieves n^{-1/4} Gaussian approximation bound

Researchers have developed a new method for approximating Gaussian distributions in entropy-regularized Q-learning with function approximation. The study establishes convergence rates for averaged iterates generated by asynchronous Q-learning, achieving a Gaussian approximation bound with a rate of order n^{-1/4}. This work combines linearization of the soft Bellman recursion with a Gaussian approximation for the leading martingale term, also deriving high-order moment bounds for the algorithm's final iterate. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Establishes theoretical bounds for Q-learning algorithms, potentially improving sample efficiency in reinforcement learning applications.

RANK_REASON The cluster contains an academic paper detailing a new theoretical result in machine learning.

Read on arXiv stat.ML →

New Q-learning method achieves n^{-1/4} Gaussian approximation bound

COVERAGE [2]

  1. arXiv stat.ML TIER_1 · Artemy Rubtsov, Rahul Singh, Eric Moulines, Alexey Naumov, Sergey Samsonov ·

    On Gaussian approximation for entropy-regularized Q-learning with function approximation

    arXiv:2605.17678v1 Announce Type: new Abstract: In this paper, we derive rates of convergence in the high-dimensional central limit theorem for Polyak--Ruppert averaged iterates generated by entropy-regularized asynchronous Q-learning with linear function approximation and a poly…

  2. arXiv stat.ML TIER_1 · Sergey Samsonov ·

    On Gaussian approximation for entropy-regularized Q-learning with function approximation

    In this paper, we derive rates of convergence in the high-dimensional central limit theorem for Polyak--Ruppert averaged iterates generated by entropy-regularized asynchronous Q-learning with linear function approximation and a polynomial stepsize $k^{-ω}$, $ω\in (1/2,1)$. Assumi…