Brief · PulseAugur

TOOL · arXiv cs.AI English(EN) · 7h

Unsupervised Partner Design Enables Robust Ad-hoc Teamwork

Researchers have developed a new multi-agent reinforcement learning method called Unsupervised Partner Design (UPD). This technique generates training partners dynamically during the learning process, adapting them based on a learnability criterion. UPD eliminates the need for pre-trained partner populations or manual tuning, leading to more diverse training and improved performance across various benchmarks like Level-Based Foraging and Overcooked-AI. Human-AI user studies indicated that agents trained with UPD were rated as more adaptive and less frustrating than baseline methods. AI

IMPACT This method could lead to more adaptable and human-like AI agents in collaborative tasks.

Overcooked-AI
Unsupervised Partner Design (UPD)
Overcooked Generalisation Challenge
Level-Based Foraging
Constantin Ruhdorfer