New ATOM-Bench benchmark tests robotic manipulation generalization

By PulseAugur Editorial · [2 sources] · 2026-06-15 15:08

Researchers have introduced ATOM-Bench, a new real-world benchmark designed to evaluate the atomic skills and compositional generalization capabilities of robotic manipulation policies. The benchmark includes 30 atomic tasks and 24 held-out compositional tasks, utilizing 3,000 human demonstrations for fine-tuning and evaluation. Initial tests on five representative policies revealed that while current models can grasp basic instruction-grounding, they struggle with fine-grained motor skills and reliably composing learned skills for novel tasks. AI

IMPACT This benchmark aims to improve the real-world generalization of robotic manipulation policies, addressing a key challenge in AI for robotics.

RANK_REASON The cluster describes a new academic benchmark and associated paper published on arXiv.

Read on arXiv cs.AI →

paper
other

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

arXiv cs.AI TIER_1 English(EN) · Zenan Wu, Bingqing Wei, Lu Liu, Zheqi He, Xi Wang, Jiakang Liu, Zehui Li, Guocai Yao, Jing-Shu Zheng, Xi Yang, Yongtao Wang · 2026-06-16 04:00

ATOM-Bench: A Real-World Benchmark for Atomic Skills and Compositional Generalization in Manipulation Policies

arXiv:2606.16826v1 Announce Type: cross Abstract: Generalist manipulation policies are increasingly presented as foundation models for robotic control, but their real-world generalization remains difficult to diagnose. A policy may succeed on demonstrated tasks while still failin…
arXiv cs.AI TIER_1 English(EN) · Yongtao Wang · 2026-06-15 15:08

ATOM-Bench: A Real-World Benchmark for Atomic Skills and Compositional Generalization in Manipulation Policies

Generalist manipulation policies are increasingly presented as foundation models for robotic control, but their real-world generalization remains difficult to diagnose. A policy may succeed on demonstrated tasks while still failing to execute fine-grained atomic skills or recombi…

COVERAGE [2]

ATOM-Bench: A Real-World Benchmark for Atomic Skills and Compositional Generalization in Manipulation Policies

ATOM-Bench: A Real-World Benchmark for Atomic Skills and Compositional Generalization in Manipulation Policies

RELATED ENTITIES

RELATED TOPICS