PulseAugur
实时 09:04:00
实体 Donghwan Lee

Donghwan Lee

PulseAugur coverage of Donghwan Lee — every cluster mentioning Donghwan Lee across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
2
90 天内 2
层级分布 · 90 天
最近 · 第 1/1 页 · 共 2 条
  1. TOOL · CL_16258 ·

    New Q-learning theory offers tighter convergence rate analysis

    Researchers have developed a novel theoretical framework for analyzing Q-learning, a fundamental algorithm in reinforcement learning. This new approach views Q-learning through the lens of switching systems, deriving a …

  2. RESEARCH · CL_06881 ·

    New research explores Bellman residual minimization for control tasks in reinforcement learning

    This paper introduces foundational results for Bellman residual minimization applied to policy optimization in Markov decision problems. While dynamic programming is more common, Bellman residual minimization offers adv…