PulseAugur
LIVE 12:21:52
ENTITY Sanjeev Manivannan

Sanjeev Manivannan

PulseAugur coverage of Sanjeev Manivannan — every cluster mentioning Sanjeev Manivannan across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 1 TOTAL
  1. TOOL · CL_32664 ·

    New RL method uses policy Hessian for faster convergence

    Researchers have developed a novel second-order actor-critic method for reinforcement learning in discounted Markov Decision Processes. This approach aims to accelerate convergence by incorporating curvature information…