Hugging Face has released two new Vision-Language-Action (VLA) models, SmolVLA and pi0, designed for robot control. SmolVLA is an efficient model trained on community data from Lerobot, while pi0 and pi0-FAST are presented as VLA models suitable for general robot control tasks. These releases aim to advance the capabilities of robots in understanding and acting upon visual and language instructions. AI
Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →
RANK_REASON Release of new, specialized models for robot control from a non-frontier lab, accompanied by blog posts detailing their training and capabilities.