Ego-Pi: VLA Fine-Tuning for Ego-Centric Human and Robot Data
Researchers have developed Ego-Pi, a method for fine-tuning vision-language models (VLMs) using ego-centric data from both humans and robots. This approach addresses the data scarcity issue in robotics by leveraging readily available human data to train robots for new task semantics and skill composition. The findings indicate that human data significantly enhances robot learning capabilities, even in the absence of specific robot-collected data for novel tasks. AI
IMPACT Enables robots to learn new tasks and skills more efficiently by leveraging readily available human-centric data.