PulseAugur
实时 10:53:50
English(EN) VL-UniTrack: A Unified Framework with Visual-Language Prompts for UAV-Ground Visual Tracking

VL-UniTrack 使用视觉语言提示实现无人机-地面统一目标跟踪

研究人员开发了 VL-UniTrack,一个用于同时从无人机和地面视角跟踪目标的创新框架。这种统一的方法将两个视角的特征编码到单个编码器中,克服了先前方法因孤立特征提取而存在的局限性。该框架包含一个视觉语言几何提示模块,用于将语言描述与视觉特征融合,增强跨视图交互并指导特定视图表示的学习。VL-UniTrack 还利用了置信度调制互蒸馏损失进行训练正则化,并在基准测试中展示了最先进的性能。 AI

影响 引入了一种使用视觉语言提示改进目标跟踪的新方法,可能增强监控和自主系统。

排序理由 这是一篇详细介绍新的视觉跟踪框架的研究论文。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

VL-UniTrack 使用视觉语言提示实现无人机-地面统一目标跟踪

报道来源 [2]

  1. arXiv cs.CV TIER_1 English(EN) · Boyue Xu, Ruichao Hou, Tongwei Ren, Gangshan Wu ·

    VL-UniTrack: A Unified Framework with Visual-Language Prompts for UAV-Ground Visual Tracking

    arXiv:2605.04574v1 Announce Type: new Abstract: UAV-ground visual tracking (UGVT) aims to simultaneously track the same object from both the UAV and the ground view. However, existing two-stream methods suffer from isolated feature extraction and rely heavily on implicit appearan…

  2. arXiv cs.CV TIER_1 English(EN) · Gangshan Wu ·

    VL-UniTrack: A Unified Framework with Visual-Language Prompts for UAV-Ground Visual Tracking

    UAV-ground visual tracking (UGVT) aims to simultaneously track the same object from both the UAV and the ground view. However, existing two-stream methods suffer from isolated feature extraction and rely heavily on implicit appearance matching, which struggles to establish reliab…