New MSEarth benchmark uses MLLMs for Earth science discovery

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-06 04:00

Researchers have developed MSEarth, a new multimodal benchmark designed to evaluate the capabilities of multimodal large language models (MLLMs) in Earth science reasoning. This dataset comprises over 289,000 figures with detailed captions and contextual discussions, drawn from open-access scientific publications across the five major Earth science spheres. MSEarth supports tasks like figure captioning, multiple-choice questions, and open-ended reasoning, aiming to provide a high-fidelity resource for advancing MLLMs in scientific discovery. AI

影响 Establishes a new benchmark for MLLMs in scientific reasoning, potentially accelerating AI applications in Earth science research.

排序理由 This is a research paper introducing a new benchmark dataset for evaluating multimodal large language models in Earth science. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Xiangyu Zhao, Wanghan Xu, Bo Liu, Yuhao Zhou, Fenghua Ling, Ben Fei, Xiaoyu Yue, Lei Bai, Wenlong Zhang, Xiao-Ming Wu · 2026-05-06 04:00

MSEarth: A Multimodal Benchmark for Earth Science Phenomenon Discovery with MLLMs

arXiv:2505.20740v3 Announce Type: replace Abstract: The rapid advancement of multimodal large language models (MLLMs) offers new opportunities for complex scientific challenges, yet their application in earth science-especially at the graduate level-remains underexplored due to a…

报道来源 [1]

MSEarth: A Multimodal Benchmark for Earth Science Phenomenon Discovery with MLLMs

相关实体

相关话题