English(EN) MSEarth: A Multimodal Benchmark for Earth Science Phenomenon Discovery with MLLMs

新的MSEarth基准测试使用多模态大语言模型进行地球科学发现

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-06 04:00

研究人员开发了MSEarth，这是一个新的多模态基准测试，旨在评估多模态大语言模型（MLLMs）在地球科学推理方面的能力。该数据集包含超过289,000张带有详细说明和上下文讨论的图表，这些图表来自五大地球科学领域开放获取的科学出版物。MSEarth支持图表说明、选择题和开放式推理等任务，旨在为推进MLLMs在科学发现中的应用提供一个高保真度的资源。 AI

影响为多模态大语言模型在科学推理方面建立了一个新的基准测试，有可能加速人工智能在地球科学研究中的应用。

排序理由这是一篇介绍用于评估多模态大语言模型在地球科学领域能力的新基准数据集的研究论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Xiangyu Zhao, Wanghan Xu, Bo Liu, Yuhao Zhou, Fenghua Ling, Ben Fei, Xiaoyu Yue, Lei Bai, Wenlong Zhang, Xiao-Ming Wu · 2026-05-06 04:00

MSEarth: A Multimodal Benchmark for Earth Science Phenomenon Discovery with MLLMs

arXiv:2505.20740v3 Announce Type: replace Abstract: The rapid advancement of multimodal large language models (MLLMs) offers new opportunities for complex scientific challenges, yet their application in earth science-especially at the graduate level-remains underexplored due to a…

报道来源 [1]

MSEarth: A Multimodal Benchmark for Earth Science Phenomenon Discovery with MLLMs

相关实体

相关话题