PulseAugur
实时 11:41:50

MAPLE algorithm enhances AlphaZero for imperfect-information games

Researchers have developed a new tree search method called MAPLE, designed to improve the performance of AlphaZero-style algorithms in imperfect-information games. Unlike previous methods that struggle with strategy fusion or high computational costs, MAPLE aggregates policy and value evaluations from multiple sampled world states within a single search tree. Experiments on Phantom Go and Dark Hex demonstrated significant Elo improvements, showing MAPLE's effectiveness for AlphaZero-like learning in complex games. AI

影响 Introduces a novel approach for AI agents to master complex games with incomplete information.

排序理由 Academic paper detailing a new algorithm for game AI. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

报道来源 [1]

  1. arXiv cs.AI TIER_1 English(EN) · Qian-Rong Li, Hung Guei, I-Chen Wu, Ti-Rong Wu ·

    MAPLE: Multi-State Aggregated Policy Evaluation for AlphaZero in Imperfect-Information Games

    arXiv:2605.24139v1 Announce Type: new Abstract: Imperfect-information games (IIGs) are challenging, as players must make decisions without fully observing the true game state. While AlphaZero has achieved remarkable success in perfect-information games, extending it to IIGs remai…