LiAuto-GeoX transformer enables real-time 3D driving scene understanding

By PulseAugur Editorial · [2 sources] · 2026-06-04 06:58

Researchers have developed LiAuto-GeoX, an efficient transformer model for real-time 3D scene understanding in autonomous driving. This model leverages sparse LiDAR data for geometric grounding and employs a novel distillation framework to create a compact 155M-parameter onboard version. LiAuto-GeoX achieves high-fidelity reconstruction at 220 FPS on the KITTI dataset and shows strong performance in downstream tasks like trajectory and occupancy prediction. AI

IMPACT Enables real-time, onboard 3D scene understanding for autonomous vehicles, potentially improving safety and efficiency.

RANK_REASON The cluster contains an academic paper detailing a new model and its performance on benchmarks.

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

arXiv cs.CV TIER_1 English(EN) · Jiawei Lian, Haoyi Sun, Yang Wu, Lifu Mu, Siyuan Wang, Le Hui, Ning Mao, Tao Wei, Pan Zhou, Kun Zhan, Jian Yang · 2026-06-05 04:00

LiAuto-GeoX: Efficient Grounded Driving Transformer

arXiv:2606.05774v1 Announce Type: new Abstract: Dense 3D reconstruction has demonstrated immense potential for spatial understanding, yet its viability as a real-time, onboard representation for autonomous driving remains an open challenge. Existing large-scale visual geometry mo…
arXiv cs.CV TIER_1 English(EN) · Jian Yang · 2026-06-04 06:58

LiAuto-GeoX: Efficient Grounded Driving Transformer

Dense 3D reconstruction has demonstrated immense potential for spatial understanding, yet its viability as a real-time, onboard representation for autonomous driving remains an open challenge. Existing large-scale visual geometry models typically require substantial computational…

COVERAGE [2]

LiAuto-GeoX: Efficient Grounded Driving Transformer

LiAuto-GeoX: Efficient Grounded Driving Transformer

RELATED ENTITIES

RELATED TOPICS