English(EN) Evolution Strategies

OpenAI发现进化策略在AI训练方面可与强化学习相媲美

作者 PulseAugur 编辑部 · [2 个来源] · 2017-03-24 07:00

OpenAI的研究人员发现，进化策略（ES）这一已有数十年的优化技术，在Atari和MuJoCo等基准测试中，其性能可以与现代强化学习（RL）方法相媲美。ES的优势包括实现更简单，无需反向传播，在分布式环境中更易于扩展，以及能更好地处理稀疏奖励。这种方法比传统的RL训练智能体速度显著更快，一项实验将人形步行者（humanoid walker）的训练时间从10小时缩短到10分钟。 AI

排序理由这是来自OpenAI的一篇研究论文，详细介绍了将一种已知优化技术新颖应用于AI基准测试。

在 Lil'Log (Lilian Weng) 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

OpenAI News TIER_1 English(EN) · 2017-03-24 07:00

进化策略作为强化学习的可扩展替代方案

We’ve discovered that evolution strategies (ES), an optimization technique that’s been known for decades, rivals the performance of standard reinforcement learning (RL) techniques on modern RL benchmarks (e.g. Atari/MuJoCo), while overcoming many of RL’s inconveniences.
Lil'Log (Lilian Weng) TIER_1 English(EN) · 2019-09-05 00:00

Evolution Strategies

<!-- Gradient descent is not the only option when learning optimal model parameters. Evolution Strategies (ES) works out well in the cases where we don't know the precise analytic form of an objective function or cannot compute the gradients directly. This post dives into several…

报道来源 [2]

进化策略作为强化学习的可扩展替代方案

Evolution Strategies

相关实体

相关话题