PulseAugur
EN
LIVE 15:47:08
ENTITY Policy Evaluation

Policy Evaluation

PulseAugur coverage of Policy Evaluation — every cluster mentioning Policy Evaluation across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 2 TOTAL
  1. RESEARCH · CL_109497 ·

    New minimax PAC bounds for learning in exogenous contextual MDPs

    Researchers have developed new minimax PAC bounds for learning in exogenous contextual Markov decision processes (MDPs). The study focuses on tabular discounted MDPs with exogenous, i.i.d. contexts that can influence re…

  2. RESEARCH · CL_06881 ·

    New research explores Bellman residual minimization for control tasks in reinforcement learning

    This paper introduces foundational results for Bellman residual minimization applied to policy optimization in Markov decision problems. While dynamic programming is more common, Bellman residual minimization offers adv…