FUSER transformer achieves efficient multiview 3D registration with diffusion refinement

By PulseAugur Editorial · [1 sources] · 2026-04-28 04:00

Researchers have introduced FUSER, a novel feed-forward transformer designed for multiview 3D registration. This model processes all scans simultaneously in a latent space to directly predict global poses, bypassing the need for computationally intensive pairwise matching. FUSER utilizes a sparse 3D CNN to encode scans and a Geometric Alternating Attention module for efficient reasoning, incorporating 2D attention priors from foundation models. Additionally, FUSER-DF, a diffusion refinement framework, further enhances registration accuracy by denoising estimates in the joint SE(3)$^N$ space. AI

IMPACT Introduces a novel transformer architecture for 3D registration, potentially improving efficiency and accuracy in applications like robotics and augmented reality.

RANK_REASON This is a research paper introducing a new method for 3D registration.

Read on arXiv cs.CV →

paper
other

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

arXiv cs.CV TIER_1 English(EN) · Haobo Jiang, Jin Xie, Jian Yang, Liang Yu, Jianmin Zheng · 2026-04-28 04:00

FUSER: Feed-Forward MUltiview 3D Registration Transformer and SE(3)$^N$ Diffusion Refinement

arXiv:2512.09373v2 Announce Type: replace Abstract: Registration of multiview point clouds conventionally relies on extensive pairwise matching to build a pose graph for global synchronization, which is computationally expensive and inherently ill-posed without holistic geometric…

COVERAGE [1]

FUSER: Feed-Forward MUltiview 3D Registration Transformer and SE(3)$^N$ Diffusion Refinement

RELATED ENTITIES

RELATED TOPICS