Robots learn manipulation from human videos using keypoint tracking

By PulseAugur Editorial · [3 sources] · 2026-06-09 09:13

Researchers have developed a new framework called Dexterous Point Policy that learns robotic manipulation skills directly from human videos, eliminating the need for costly robot-specific demonstrations. The system utilizes a unified 3D keypoint representation of objects and hands to bridge the gap between human and robot actions. This approach achieved a 75.0% success rate on real-world tasks, significantly outperforming a state-of-the-art baseline which managed only 1.0% success. AI

IMPACT Enables robots to learn complex manipulation tasks from readily available human video data, reducing development costs and accelerating deployment.

RANK_REASON The cluster contains an academic paper detailing a new research framework.

Read on arXiv cs.LG →

paper
other

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

COVERAGE [3]

arXiv cs.AI TIER_1 English(EN) · Harsh Gupta, Guanya Shi, Wenzhen Yuan · 2026-06-11 04:00

LUCID: Learning Embodiment-Agnostic Intent Models from Unstructured Human Videos for Scalable Dexterous Robot Skill Acquisition

arXiv:2606.11628v1 Announce Type: cross Abstract: The most widely-adopted robot learning pipelines today learn skills from robot demonstrations or structured human data, which are expensive to collect and tied to specific embodiments. In contrast, unstructured human videos provid…
arXiv cs.LG TIER_1 English(EN) · Beomjun Kim, Seong Hyeon Park, Seunghoon Sim, Seungjun Moon, Sanghyeok Lee, Jinwoo Shin · 2026-06-10 04:00

Dexterous Point Policy: Learning Point-based Dexterous Hand Policies from Human Demonstrations

arXiv:2606.10614v1 Announce Type: cross Abstract: Robotic foundation models pre-trained on human demonstration videos have shown promise, but a significant embodiment gap remains when the resulting policies are deployed on real robots. A common remedy is to fine-tune these models…
arXiv cs.CV TIER_1 English(EN) · Jinwoo Shin · 2026-06-09 09:13

Dexterous Point Policy: Learning Point-based Dexterous Hand Policies from Human Demonstrations

Robotic foundation models pre-trained on human demonstration videos have shown promise, but a significant embodiment gap remains when the resulting policies are deployed on real robots. A common remedy is to fine-tune these models on robot-specific demonstrations. However, robot …

COVERAGE [3]

LUCID: Learning Embodiment-Agnostic Intent Models from Unstructured Human Videos for Scalable Dexterous Robot Skill Acquisition

Dexterous Point Policy: Learning Point-based Dexterous Hand Policies from Human Demonstrations

Dexterous Point Policy: Learning Point-based Dexterous Hand Policies from Human Demonstrations

RELATED ENTITIES

RELATED TOPICS