Researchers have developed a novel self-improving planning framework called WA* that combines a value heuristic represented by a Relational Graph Neural Network with Q-learning. This approach guides search and uses the resulting data to update the heuristic, enabling it to function as a general policy. The framework demonstrates strong zero-shot generalization capabilities, solving new problem instances without search, which is a significant advancement over traditional Deep Reinforcement Learning methods in sparse-reward domains. The system has shown success on benchmarks like Sokoban, PushWorld, The Witness, and the 2023 International Planning Competition. AI
IMPACT Achieves strong zero-shot generalization in planning tasks, potentially overcoming limitations of current DRL methods.
RANK_REASON The cluster contains an academic paper detailing a new AI research framework and its performance on benchmarks.
- 2023 International Planning Competition
- Blocksworld
- Deep Reinforcement Learning
- PushWorld
- Q-learning
- Sokoban
- The Witness
- WA*
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →