实体
Advantage Actor-Critic
Advantage Actor-Critic
PulseAugur coverage of Advantage Actor-Critic — every cluster mentioning Advantage Actor-Critic across labs, papers, and developer communities, ranked by signal.
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
2
90 天内 2
层级分布 · 90 天
最近 · 第 1/1 页 · 共 2 条
-
集成强化学习模型增强金融交易策略
研究人员开发了一种用于金融交易的集成强化学习(RL)方法,将A2C、PPO和SAC等RL算法与SVM、决策树和逻辑回归等传统分类器相结合。这种混合方法旨在改善风险-回报权衡并减少与独立RL模型相比的跌幅。研究发现,集成策略的性能始终优于单个模型,尽管性能对方差阈值参数\(\tau\)敏感,这表明需要动态调整。
-
Multi-agent RL ensures drone fleet separation but may favor stronger configurations
Researchers have developed a multi-agent reinforcement learning framework to ensure safe separation between fleets of small unmanned aerial systems (sUASs). The proposed attention-enhanced Proximal Policy Optimization-b…