New models enhance self-supervised depth estimation for autonomous driving

By PulseAugur Editorial · [3 sources] · 2026-06-30 11:07

Researchers have developed two new approaches to improve self-supervised monocular depth estimation for driving scenarios. FlexDepth introduces a flexible family of models with a scale-driven decoder and a two-stage training strategy to handle static and dynamic elements independently, achieving state-of-the-art performance with minimal computational overhead. DrivingDepth, on the other hand, focuses on correcting the geometry-scale conflict in depth estimation by using sparse LiDAR data as prompts to calibrate a frozen foundation model, preserving dense visual geometry while achieving superior metric accuracy and consistency. AI

IMPACT These advancements could lead to more reliable and efficient perception systems for autonomous vehicles, particularly in challenging driving conditions.

RANK_REASON The cluster contains two research papers detailing novel methods for self-supervised monocular depth estimation.

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

New models enhance self-supervised depth estimation for autonomous driving

COVERAGE [3]

arXiv cs.CV TIER_1 English(EN) · Zhaowen Zhu, Li Zhang, Yujie Chen, Tian Zhang, Yingjie Wang, Mingxia Zhan · 2026-07-02 04:00

Towards Robust Driving Perception: A Flexible Scale-Driven Family for Self-Supervised Monocular Depth Estimation

arXiv:2607.00736v1 Announce Type: new Abstract: Self-Supervised Monocular Depth Estimation (MDE) has garnered attention in recent years due to its independence from ground truth. However, most existing models are limited to a single scale and exhibit considerable performance degr…
arXiv cs.CV TIER_1 English(EN) · Mingxia Zhan · 2026-07-01 10:18

Towards Robust Driving Perception: A Flexible Scale-Driven Family for Self-Supervised Monocular Depth Estimation

Self-Supervised Monocular Depth Estimation (MDE) has garnered attention in recent years due to its independence from ground truth. However, most existing models are limited to a single scale and exhibit considerable performance degradation in complex driving environments. Network…
arXiv cs.CV TIER_1 English(EN) · Liang Wang · 2026-06-30 11:07

DrivingDepth: Sparse-Prompted Pixel-wise Scale Correction for Driving Depth Estimation

Dense depth estimation for autonomous driving faces a geometry-scale conflict: depth foundation models deliver pixel-aligned dense visual geometry without reliable metric scale, while projected LiDAR provides metric anchors that are sparse, noisy, and misaligned with image struct…

COVERAGE [3]

Towards Robust Driving Perception: A Flexible Scale-Driven Family for Self-Supervised Monocular Depth Estimation

Towards Robust Driving Perception: A Flexible Scale-Driven Family for Self-Supervised Monocular Depth Estimation

DrivingDepth: Sparse-Prompted Pixel-wise Scale Correction for Driving Depth Estimation

RELATED ENTITIES

RELATED TOPICS