PulseAugur
EN
LIVE 06:19:46
中文(ZH) 入选ECCV 2026!清华开源空间模型打败Gemini:真正的空间智能是在世界变化中持续学习

Tsinghua University's Spatial-TTT model outperforms Gemini on spatial intelligence benchmarks

Researchers from Tsinghua University have developed Spatial-TTT, an open-source spatial intelligence model that has been accepted into ECCV 2026. This model excels at continuously learning and updating its spatial memory from long video streams, outperforming models like Gemini and GPT-5 on various benchmarks. Spatial-TTT utilizes a novel hybrid architecture with fast weights for dynamic memory, a spatial prediction mechanism to better understand geometric relationships, and dense scene description supervision to build a comprehensive 3D understanding of environments. AI

IMPACT This research advances the capabilities of multimodal AI in understanding and interacting with dynamic environments, potentially accelerating applications in robotics and autonomous systems.

RANK_REASON The cluster details a new research paper and model release from a university, including benchmark results and comparisons to existing models. [lever_c_demoted from research: ic=1 ai=1.0]

Read on 量子位 (QbitAI) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Tsinghua University's Spatial-TTT model outperforms Gemini on spatial intelligence benchmarks

COVERAGE [1]

  1. 量子位 (QbitAI) TIER_1 中文(ZH) · 思邈 ·

    Selected for ECCV 2026! Tsinghua open-source spatial model defeats Gemini: True spatial intelligence is continuous learning amidst world changes

    120分钟长视频一边看一边记