Mahjax: A GPU-Accelerated Mahjong Simulator for Reinforcement Learning in JAX
Researchers have developed Mahjax, a new GPU-accelerated simulator for the complex game of Riichi Mahjong, implemented in JAX. This tool is designed to facilitate reinforcement learning research, particularly for agents learning from scratch rather than relying on human play data. Mahjax achieves high throughput, processing up to 2 million steps per second on multiple GPUs, and has been validated for training agents to improve their performance. AI
IMPACT Enables large-scale reinforcement learning research for complex games, potentially leading to more general AI decision-making capabilities.