English(EN) Benchmarking the Alignment of Data-Quality Metrics, Human Judgment and Land-Cover Segmentation Performance for Earth Observation

AI数据质量指标与人类感知和任务性能不一致

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-25 04:00

一篇新发表在arXiv上的论文探讨了自动化数据质量指标与其在深度学习模型（尤其是在地球观测领域）中的实际效用之间的脱节。研究强调，像FID和LPIPS这样关注视觉保真度的常用指标，并不总是与人类感知或下游任务性能一致。研究发现，像旋转这样的扰动会显著改变指标得分，而不会影响人类识别；并且在自动化指标上得分较低的合成数据，在与真实数据一起使用时，可以提高下游性能。作者总结认为，在评估地理空间应用的合成数据集时，应优先考虑人类评估和任务特定性能，而不是纯粹的视觉保真度指标。 AI

影响强调了使用自动化指标评估合成数据质量时可能存在的陷阱，影响AI模型的训练和评估。

排序理由在arXiv上发表的学术论文，详细介绍了研究结果。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · \"Umit Mert \c{C}a\u{g}lar, Alptekin Temizel · 2026-06-25 04:00

Benchmarking the Alignment of Data-Quality Metrics, Human Judgment and Land-Cover Segmentation Performance for Earth Observation

arXiv:2606.25128v1 Announce Type: cross Abstract: Volume and quality of datasets are crucial for deep learning model training, yet they are often constrained by availability and data acquisition costs. Synthetic data augmentation can extend existing datasets with realistic images…

报道来源 [1]

Benchmarking the Alignment of Data-Quality Metrics, Human Judgment and Land-Cover Segmentation Performance for Earth Observation

相关实体

相关话题