Dec-POMDP
PulseAugur coverage of Dec-POMDP — every cluster mentioning Dec-POMDP across labs, papers, and developer communities, ranked by signal.
-
MARL research unifies observation and action delay for efficient learning
Researchers have formally established the structural equivalence between observation delay and action delay in cooperative partially observable multi-agent systems. They demonstrated that both systems produce identical …
-
New research shows high entropy leads to symmetry equivariant policies in Dec-POMDPs
A new paper explores how high entropy regularization can lead to symmetry-equivariant policies in Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs). The research demonstrates that sufficiently hi…
-
New C++ engine HASE achieves 33M steps/sec for multi-agent RL training
Researchers have developed a new C++ engine called Hide-And-Seek-Engine (HASE) designed to significantly improve the efficiency of training reinforcement learning agents in decentralized, partially observable environmen…