PulseAugur
EN
LIVE 12:09:38
ENTITY Q-learning

Q-learning

PulseAugur coverage of Q-learning — every cluster mentioning Q-learning across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
23
23 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
22
22 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

7 day(s) with sentiment data

RECENT · PAGE 1/2 · 23 TOTAL
  1. RESEARCH · CL_111228 ·

    New Heavy-Ball Q-Learning method promises faster reinforcement learning convergence

    Researchers have introduced a novel Heavy-Ball Q-Learning method designed to enhance reinforcement learning algorithms. This new approach establishes convergence guarantees and identifies conditions under which it can t…

  2. RESEARCH · CL_99555 ·

    New robust Q-learning algorithm tackles mean-field control with Wasserstein uncertainty

    Researchers have developed a new robust Q-learning algorithm designed for mean-field control problems. This algorithm addresses challenges posed by Wasserstein uncertainty in common noise laws by integrating a quantizat…

  3. RESEARCH · CL_97808 ·

    Quantum Annealing boosts AI for predictive maintenance · 2 sources tracked

    Researchers have developed a novel Quantum Annealing enhanced Q-Learning (QAQL) framework to improve Remaining Useful Lifetime (RUL) prediction in predictive maintenance. This approach integrates quantum annealing's sam…

  4. TOOL · CL_93859 ·

    New Q-Learning Algorithms Offer Fine-Grained Regret Bounds

    Researchers have developed new algorithms for Q-learning that provide more precise regret bounds in episodic tabular Markov Decision Processes. These advancements address limitations in existing methods by offering fine…

  5. RESEARCH · CL_93397 ·

    New theory advances Q-learning in continuous stochastic control

    Researchers have published a paper on arXiv detailing a theoretical advancement in Q-learning, a fundamental algorithm in reinforcement learning. The study focuses on the mathematical underpinnings of Q-learning within …

  6. RESEARCH · CL_82029 ·

    New Q-Learning Method Enhances Stability with Geometric Target Updates

    Researchers have introduced a new method called the $\lambda$-target update for linear Q-learning, which averages periodic target updates with geometric weights. This technique aims to improve the stability of Q-learnin…

  7. TOOL · CL_80125 ·

    New algorithm BLINQ learns Whittle indices for Markov Decision Processes

    Researchers have developed BLINQ, a novel model-based algorithm designed to learn Whittle indices for Markov Decision Processes. This new approach constructs an empirical estimate of the MDP and then computes the indice…

  8. RESEARCH · CL_65476 ·

    New research explores Q-learning stability and offline RL methods

    Two new research papers explore advancements in reinforcement learning techniques. One paper introduces Drift Q-Learning, a method that combines a drift-based behavioral regularizer with critic-driven policy improvement…

  9. RESEARCH · CL_62198 ·

    Lyapunov framework analyzes stochastic algorithm convergence

    Researchers have published a paper detailing a Lyapunov-based framework for analyzing the finite-time convergence of stochastic iterative algorithms. This approach uses generalized Moreau envelopes as universal Lyapunov…

  10. RESEARCH · CL_62182 ·

    Delayed regulation destabilizes adaptive AI agents, study finds

    A new research paper explores how delays in regulatory intervention can destabilize adaptive multi-agent systems. The study found that reactive agents, which immediately respond to signals, are highly susceptible to ins…

  11. TOOL · CL_53753 ·

    Q-learning integration boosts offline In-Context RL performance

    A new research paper explores the effectiveness of integrating Reinforcement Learning (RL) objectives into offline In-Context Reinforcement Learning (ICRL) methods. Experiments across over 150 datasets in GridWorld and …

  12. TOOL · CL_51393 ·

    New RL policies boost high-frequency trading performance

    Researchers have developed new reinforcement learning policies for high-frequency trading on limit order books. Their approach utilizes Order-Flow signals as a state representation and employs policy-gradient methods, s…

  13. RESEARCH · CL_50645 ·

    New WA* framework achieves zero-shot generalization in AI planning

    Researchers have developed a novel self-improving planning framework called WA* that combines a value heuristic represented by a Relational Graph Neural Network with Q-learning. This approach guides search and uses the …

  14. TOOL · CL_44960 ·

    New Q-learning algorithm robust to corrupted rewards

    Researchers have developed a new variant of Q-learning designed to handle adversarially corrupted rewards in reinforcement learning settings. This novel algorithm is analyzed under asynchronous sampling conditions and p…

  15. RESEARCH · CL_38193 ·

    New Q-learning method achieves n^{-1/4} Gaussian approximation bound

    Researchers have developed a new method for approximating Gaussian distributions in entropy-regularized Q-learning with function approximation. The study establishes convergence rates for averaged iterates generated by …

  16. TOOL · CL_36955 ·

    Q-Learning Error Analysis Reveals Overestimation Dynamics

    Researchers have developed a novel finite-time error analysis for Q-learning algorithms using constant step sizes. The analysis decomposes the error into negative and positive components, revealing that the negative par…

  17. TOOL · CL_36609 ·

    Q-learning agent mimics insect behavior for odor source detection

    Researchers have developed a Q-learning agent capable of navigating turbulent flows to find odor sources, utilizing a minimal memory of the time elapsed since the last scent detection. This agent successfully learned st…

  18. TOOL · CL_36613 ·

    LLM and Q-learning enhance cloud intrusion detection system

    Researchers have developed a novel multi-layer intrusion detection system (IDS) for cloud environments that integrates large language models (LLMs) and adaptive Q-learning. This system operates across network, host, and…

  19. TOOL · CL_22473 ·

    New Long-Horizon Q-Learning method improves reinforcement learning accuracy

    Researchers have introduced Long-Horizon Q-Learning (LQL), a novel method designed to improve the stability of value-based reinforcement learning. LQL addresses the issue of compounding estimation errors in traditional …

  20. TOOL · CL_21970 ·

    New ME-AM framework enhances offline RL with entropy maximization

    Researchers have introduced Maximum Entropy Adjoint Matching (ME-AM), a new framework designed to improve offline reinforcement learning. This method addresses limitations in existing approaches, such as popularity bias…