实体
Overcooked
Overcooked
PulseAugur coverage of Overcooked — every cluster mentioning Overcooked across labs, papers, and developer communities, ranked by signal.
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
2
90 天内 2
层级分布 · 90 天
最近 · 第 1/1 页 · 共 2 条
-
新研究表明高熵导致Dec-POMDP中的对称等变策略
一篇新论文探讨了高熵正则化如何在分布式部分可观察马尔可夫决策过程(Dec-POMDPs)中产生对称等变策略。研究表明,足够高的熵可以确保策略梯度流在不同初始化下收敛到兼容的联合策略。在Hanabi和Overcooked等环境中的实证测试表明,增加熵系数会显著影响跨局回报,并且在训练后通过贪婪化策略有改进的潜力。
-
Researchers develop zero-shot coordination for multi-agent AI with diverse reward shapings
Researchers have developed a new method for Zero-Shot Coordination (ZSC) in multi-agent reinforcement learning, enabling agents to cooperate effectively with unknown partners even when reward signals are shaped differen…