ENTITY
Policy Evaluation
Policy Evaluation
PulseAugur coverage of Policy Evaluation — every cluster mentioning Policy Evaluation across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D
1 day(s) with sentiment data
RECENT · PAGE 1/1 · 2 TOTAL
-
New minimax PAC bounds for learning in exogenous contextual MDPs
Researchers have developed new minimax PAC bounds for learning in exogenous contextual Markov decision processes (MDPs). The study focuses on tabular discounted MDPs with exogenous, i.i.d. contexts that can influence re…
-
New research explores Bellman residual minimization for control tasks in reinforcement learning
This paper introduces foundational results for Bellman residual minimization applied to policy optimization in Markov decision problems. While dynamic programming is more common, Bellman residual minimization offers adv…