English(EN) Cross4D-JEPA: Dense Cross-modal Correspondence Distillation for 4D Point Cloud Representation Learning

新的 Cross4D-JEPA 方法蒸馏 2D 模型以理解 4D 点云

作者 PulseAugur 编辑部 · [2 个来源] · 2026-07-01 06:49

研究人员推出了一种新颖的自监督学习方法 Cross4D-JEPA，用于理解动态 4D 点云。该方法将来自 DINOv2 和 V-JEPA 2 等 2D 图像或视频基础模型的知识蒸馏到一个 4D 点编码器中。Cross4D-JEPA 利用密集的跨模态对应关系将 3D 点映射到教师块特征，训练学生编码器以匹配这些特征，而无需掩码、负样本或解码器。与单模态和全局跨模态基线相比，该方法在 MSR-Action3D 和 NTU RGB+D 60 等基准测试中表现出优越的性能，突显了其细粒度对应方法的有效性。 AI

影响增强了 4D 点云分析的自监督学习，可能改进机器人和具身感知。

排序理由介绍新方法及其评估的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Trung Thanh Nguyen, Hai Nguyen-Truong, Tu Vo, Hoang M. Truong, Tuan-Anh Vu · 2026-07-02 04:00

Cross4D-JEPA: Dense Cross-modal Correspondence Distillation for 4D Point Cloud Representation Learning

arXiv:2607.00514v1 Announce Type: cross Abstract: Automatic understanding of dynamic 4D point clouds, the 3D-point sequences captured over time by depth sensors and LiDAR, is central to robotics and embodied perception. Yet annotating them densely is expensive, making self-superv…
arXiv cs.AI TIER_1 English(EN) · Tuan-Anh Vu · 2026-07-01 06:49

Cross4D-JEPA：用于4D点云表示学习的密集跨模态对应蒸馏

Automatic understanding of dynamic 4D point clouds, the 3D-point sequences captured over time by depth sensors and LiDAR, is central to robotics and embodied perception. Yet annotating them densely is expensive, making self-supervised pretraining the natural route to transferable…

报道来源 [2]

Cross4D-JEPA: Dense Cross-modal Correspondence Distillation for 4D Point Cloud Representation Learning

Cross4D-JEPA：用于4D点云表示学习的密集跨模态对应蒸馏

相关实体

相关话题