ENTITY Policy Evaluation

Policy Evaluation

PulseAugur coverage of Policy Evaluation — every cluster mentioning Policy Evaluation across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

2 over 90d

Releases · 30d

0 over 90d

Papers · 30d

2 over 90d

TIER MIX · 90D

TOPICS

SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 2 TOTAL

RESEARCH · CL_109497 · Jun 23 · 21:02

New minimax PAC bounds for learning in exogenous contextual MDPs

Researchers have developed new minimax PAC bounds for learning in exogenous contextual Markov decision processes (MDPs). The study focuses on tabular discounted MDPs with exogenous, i.i.d. contexts that can influence re…
RESEARCH · CL_06881 · Apr 28 · 04:00

New research explores Bellman residual minimization for control tasks in reinforcement learning

This paper introduces foundational results for Bellman residual minimization applied to policy optimization in Markov decision problems. While dynamic programming is more common, Bellman residual minimization offers adv…

New minimax PAC bounds for learning in exogenous contextual MDPs

New research explores Bellman residual minimization for control tasks in reinforcement learning