LoGeR model enables long-context geometric reconstruction with hybrid memory

作者 PulseAugur 编辑部 · [1 个来源] · 2026-04-28 04:00

Researchers have introduced LoGeR, a novel architecture designed for long-context geometric reconstruction in videos. This system addresses the limitations of existing feedforward models by processing video streams in chunks and employing a hybrid memory module. This module combines parametric Test-Time Training memory for global frame anchoring and a non-parametric Sliding Window Attention for precise alignment, enabling robust reconstruction over thousands of frames. AI

影响 Enables robust, globally consistent 3D reconstruction over unprecedented video horizons, potentially improving applications in robotics and autonomous systems.

排序理由 This is a research paper detailing a new model architecture for geometric reconstruction.

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Junyi Zhang, Charles Herrmann, Junhwa Hur, Chen Sun, Ming-Hsuan Yang, Forrester Cole, Trevor Darrell, Deqing Sun · 2026-04-28 04:00

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

arXiv:2603.03269v2 Announce Type: replace Abstract: Feedforward geometric foundation models achieve strong short-window reconstruction, yet scaling them to minutes-long videos is bottlenecked by quadratic attention complexity or limited effective memory in recurrent designs. We p…

报道来源 [1]

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

相关实体

相关话题