NVIDIA's Jim Fan declared the end of Visual-Language-Action (VLA) models and remote operation in robotics, advocating for World Action Models (WAM) as the new paradigm. Fan proposed that WAMs, inspired by Large Language Models (LLMs), will leverage next-state prediction and action fine-tuning for robot control. He emphasized a shift towards using first-person human video data as the primary training source, moving away from the limitations of remote operation data collection. AI
影响 This commentary signals a potential shift in robotics research and development, moving towards new model architectures and data strategies.
排序理由 This is a commentary on the future of robotics by a prominent researcher, not a direct model release or product announcement.
- Andrej Karpathy
- DreamDojo
- Dream Zero
- EgoScale
- Elon Musk
- Jensen Huang
- Jim Fan
- LLM
- NVIDIA
- Robotics: Endgame
- Sunday
- Taylor Swift
- WAM
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →