PulseAugur
实时 15:23:54
English(EN) FusionRS: A Large-Scale RGB-Infrared Remote Sensing Dataset for Dual-Modal Vision-Language Foundation Models

新的FusionRS数据集增强了RGB-红外视觉语言模型

研究人员推出了FusionRS,这是一个新颖的大规模数据集,旨在推进遥感领域中双模态视觉语言基础模型的发展。该数据集独特地结合了RGB和红外图像及其对应的文本描述,解决了当前模型中红外数据探索不足的问题。使用FusionRS进行的实验表明,在RGB-IR对齐和图像描述等任务上的性能有所提高,突显了特定模态文本监督的价值。 AI

影响 该数据集通过整合热成像和视觉数据,有望实现更复杂的遥感分析,从而可能改进在环境监测和城市规划等领域的应用。

排序理由 该集群描述了一个在arXiv上发布的新数据集和相关的研究论文,arXiv是学术研究的常见场所。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.AI TIER_1 English(EN) · Jiaju Han, Ben Zhang, Xuemeng Sun, Qike Zhang, Yuxian Dong, Chengyin Hu, Fengyu Zhang, Yiwei Wei, Jiujiang Guo ·

    FusionRS: A Large-Scale RGB-Infrared Remote Sensing Dataset for Dual-Modal Vision-Language Foundation Models

    arXiv:2606.17020v1 Announce Type: cross Abstract: Remote sensing vision-language models have advanced Earth observation understanding, but most existing work remains centered on RGB imagery, leaving the complementary information in infrared data underexplored. Infrared images pro…

  2. arXiv cs.CV TIER_1 English(EN) · Jiujiang Guo ·

    FusionRS: A Large-Scale RGB-Infrared Remote Sensing Dataset for Dual-Modal Vision-Language Foundation Models

    Remote sensing vision-language models have advanced Earth observation understanding, but most existing work remains centered on RGB imagery, leaving the complementary information in infrared data underexplored. Infrared images provide distinctive cues, including thermal intensity…