Brief · PulseAugur

TOOL · arXiv cs.LG English(EN) · 1mo

Actor-Critic Algorithm for Dynamic Expectile and CVaR

Researchers have developed a new actor-critic algorithm designed to optimize dynamic risk management in stochastic policies. This novel approach bypasses the need for transition perturbation in policy updates and utilizes model-free value learning for dynamic expectile and conditional value-at-risk. The algorithm has demonstrated superior performance in learning risk-averse policies through empirical testing in domains exhibiting verifiable risk-averse behavior. AI

IMPACT Introduces a new algorithmic approach for optimizing risk-averse policies in machine learning applications.

Actor-Critic Algorithm
Dynamic Expectile