PulseAugur
LIVE 18:55:33
ENTITY Temporal Difference (TD) learning

Temporal Difference (TD) learning

PulseAugur coverage of Temporal Difference (TD) learning — every cluster mentioning Temporal Difference (TD) learning across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 1 TOTAL
  1. TOOL · CL_43579 ·

    New bounds enhance statistical inference for Reinforcement Learning

    Researchers have developed new high-dimensional concentration inequalities and Berry-Esseen bounds for martingales induced by Markov chains. These findings are applied to analyze Temporal Difference (TD) learning with l…