Hanabi
PulseAugur coverage of Hanabi — every cluster mentioning Hanabi across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
MARL benchmarks may not require complex reasoning, study finds
A new research paper published on arXiv questions the effectiveness of current benchmarks in cooperative multi-agent reinforcement learning (MARL). The study introduces diagnostic tools to assess whether agents truly em…
-
New research shows high entropy leads to symmetry equivariant policies in Dec-POMDPs
A new paper explores how high entropy regularization can lead to symmetry-equivariant policies in Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs). The research demonstrates that sufficiently hi…
-
LLM reasoning improved by graph integration, not just graph reading
Researchers explored how explicit belief graphs impact Large Language Model (LLM) performance in cooperative multi-agent reasoning tasks, specifically the card game Hanabi. Their findings indicate that the integration a…