Researchers develop continuous-time q-learning for mean-field control with common noise

By PulseAugur Editorial · [2 sources] · 2026-05-01 04:00

This two-part paper introduces theoretical foundations and algorithms for continuous-time Q-learning in mean-field control with common noise. Part I establishes the theoretical framework, defining an integrated Q-function (Iq-function) and deriving conditions for optimal policies as fixed points. Part II builds upon this by devising Q-learning algorithms, including an Actor-Critic approach, and demonstrating their convergence and performance in linear-quadratic and other settings. AI

IMPACT Introduces novel Q-learning algorithms for complex control problems, potentially advancing reinforcement learning applications in multi-agent systems.

RANK_REASON This is a research paper published on arXiv detailing theoretical foundations and algorithms for a specific type of control problem.

Read on arXiv cs.LG →

paper
other

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

Researchers develop continuous-time q-learning for mean-field control with common noise

COVERAGE [2]

arXiv cs.LG TIER_1 English(EN) · Zhenjie Ren, Xiaoli Wei, Xiang Yu, Xun Yu Zhou · 2026-05-01 04:00

Continuous-time q-learning for mean-field control with common noise, part-I: Theoretical foundations

arXiv:2604.27372v1 Announce Type: cross Abstract: This paper investigates the continuous-time counterpart of the Q-function for entropy-regularized mean-field control (MFC) with controlled common noise, coined as q-function by Jia and Zhou (2023) in the single agent's model. We f…
arXiv cs.LG TIER_1 English(EN) · Zhenjie Ren, Xiaoli Wei, Xiang Yu, Xun Yu Zhou · 2026-05-01 04:00

Continuous-time q-learning for mean-field control with common noise, part-II: q-learning algorithms

arXiv:2604.27378v1 Announce Type: cross Abstract: This paper is a continuation work of Ren et al. (2026) aiming to further devise q-learning algorithms for mean-field control (MFC) with controlled common noise. Based on the relaxed control formulation, we first establish the mart…

COVERAGE [2]

Continuous-time q-learning for mean-field control with common noise, part-I: Theoretical foundations

Continuous-time q-learning for mean-field control with common noise, part-II: q-learning algorithms

RELATED ENTITIES

RELATED TOPICS