English(EN) CIPER: A Unified Framework for Cross-view Image-retrieval and Pose-estimation

CIPER框架统一图像检索和姿态估计

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-04 04:00

研究人员开发了CIPER，一个统一地理定位的跨视图图像检索和姿态估计的新框架。与之前将这些视为独立任务的方法不同，CIPER的单一架构通过学习互惠特征来联合执行这两项任务。该系统利用共享的Transformer编码器和特定任务的token来区分检索和定位线索，并通过双向Transformer姿态解码器解决地面和航空影像之间的域差距。在基准数据集上的实验表明，该系统具有竞争力，尤其是在视场有限和任意方向等挑战性条件下。 AI

影响引入了一种统一的跨视图地理定位方法，有望提高自动驾驶和地图绘制等应用的准确性和效率。

排序理由详细介绍计算机视觉任务新框架的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CV TIER_1 English(EN) · Yurim Jeon, Dongseong Seo, Seung-Woo Seo · 2026-06-04 04:00

CIPER: A Unified Framework for Cross-view Image-retrieval and Pose-estimation

arXiv:2606.05011v1 Announce Type: new Abstract: Cross-view geo-localization estimates the geographic location of a ground image by matching it against an aerial image database. Existing methods tackle this through either large-scale retrieval or precise pose estimation, but not b…

报道来源 [1]

CIPER: A Unified Framework for Cross-view Image-retrieval and Pose-estimation

相关实体

相关话题