English(EN) Rethinking Temporal Consistency in Video Object-Centric Learning: From Prediction to Correspondence

Grounded Correspondence框架简化视频对象学习

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-05 11:29

研究人员引入了一个名为Grounded Correspondence的新框架，用于视频对象中心学习。该方法用确定性的二分匹配取代了传统学习的动力学模块，利用现有的自监督视觉骨干网络来保持时间一致性。该方法在时间建模方面不需要任何可学习的参数，并在多个基准测试中取得了有竞争力的结果。 AI

影响引入了一种新颖的、无参数的时间一致性方法用于视频对象学习，可能简化模型架构。

排序理由这是一篇发表在arXiv上的研究论文，详细介绍了视频对象中心学习的新框架。

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.LG TIER_1 English(EN) · Joni Pajarinen · 2026-05-05 11:29

重新思考视频对象中心学习中的时间一致性：从预测到对应

The de facto approach in video object-centric learning maintains temporal consistency through learned dynamics modules that predict future object representations, called slots. We demonstrate that these predictors function as expensive approximations of discrete correspondence pr…
arXiv cs.CV TIER_1 English(EN) · Zhiyuan Li, Rongzhen Zhao, Wenyan Yang, Wenshuai Zhao, Pekka Marttinen, Joni Pajarinen · 2026-05-06 04:00

重新思考视频对象中心学习中的时间一致性：从预测到对应

arXiv:2605.03650v1 Announce Type: new Abstract: The de facto approach in video object-centric learning maintains temporal consistency through learned dynamics modules that predict future object representations, called slots. We demonstrate that these predictors function as expens…