PulseAugur
EN
LIVE 14:03:16
ENTITY best-policy extraction

best-policy extraction

PulseAugur coverage of best-policy extraction — every cluster mentioning best-policy extraction across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 1 TOTAL
  1. RESEARCH · CL_109497 ·

    New minimax PAC bounds for learning in exogenous contextual MDPs

    Researchers have developed new minimax PAC bounds for learning in exogenous contextual Markov decision processes (MDPs). The study focuses on tabular discounted MDPs with exogenous, i.i.d. contexts that can influence re…