实体
Donghwan Lee
Donghwan Lee
PulseAugur coverage of Donghwan Lee — every cluster mentioning Donghwan Lee across labs, papers, and developer communities, ranked by signal.
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
2
90 天内 2
层级分布 · 90 天
最近 · 第 1/1 页 · 共 2 条
-
New Q-learning theory offers tighter convergence rate analysis
Researchers have developed a novel theoretical framework for analyzing Q-learning, a fundamental algorithm in reinforcement learning. This new approach views Q-learning through the lens of switching systems, deriving a …
-
New research explores Bellman residual minimization for control tasks in reinforcement learning
This paper introduces foundational results for Bellman residual minimization applied to policy optimization in Markov decision problems. While dynamic programming is more common, Bellman residual minimization offers adv…