PulseAugur
EN
LIVE 14:54:14

CoMo method learns continuous latent motion from videos for robot learning

Researchers have developed CoMo, a novel method for learning continuous latent motion from internet videos to enhance robot learning. CoMo employs an early temporal difference mechanism to increase the difficulty of shortcut learning and explicitly strengthens motion cues. Additionally, a temporal contrastive learning scheme is used to ensure latent motion better captures meaningful foregrounds by constructing positive pairs with small future temporal offsets and negative pairs by reversing temporal direction. This approach allows CoMo to exhibit strong zero-shot generalization, generating effective pseudo action labels for unseen videos and leading to superior performance in policies co-trained with these labels. AI

IMPACT Enables more scalable and effective robot learning by leveraging vast internet video data for motion understanding.

RANK_REASON The item describes a new method presented in an arXiv paper for learning from videos for robot learning. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

CoMo method learns continuous latent motion from videos for robot learning

COVERAGE [1]

  1. arXiv cs.CV TIER_1 English(EN) · Jiange Yang, Yansong Shi, Haoyi Zhu, Mingyu Liu, Kaijing Ma, Yating Wang, Gangshan Wu, Tong He, Limin Wang ·

    CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning

    arXiv:2505.17006v3 Announce Type: replace Abstract: Unsupervised learning of latent motion from Internet videos is crucial for robot learning. Existing discrete methods generally mitigate the shortcut learning caused by extracting excessive static backgrounds through vector quant…