English(EN) The Perception-Physics Paradox: Probing Scientific Alignment with TC-Bench

新基准测试探究视觉基础模型科学推理能力

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-26 04:00

研究人员在视觉基础模型（VFMs）中发现了一个“感知-物理悖论”，即模型在视觉预测方面表现出色，但可能未能掌握潜在的物理原理。这是因为VFMs可能依赖于表面上的相关性而非结构不变性，从而在熟悉的情况下做出准确预测，但在分布外情况会失败。为了解决这个问题，开发了一个名为TC-Bench的新基准测试，用于热带气旋研究，旨在评估和改进这些模型的科学对齐。 AI

影响强调了AI模型需要推理物理原理，而不仅仅是视觉相关性，才能实现可靠的科学应用。

排序理由该集群包含一篇学术论文，介绍了用于评估AI模型的新基准测试和框架。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · Dingling Yao, Andrea Polesello, Adeel Pervez, Caroline Muller, Francesco Locatello · 2026-05-26 04:00

The Perception-Physics Paradox: Probing Scientific Alignment with TC-Bench

arXiv:2605.24782v1 Announce Type: new Abstract: While Vision Foundation Models (VFMs) excel at predictive tasks on satellite imagery, their performance can arise from visual correlations rather than underlying structural invariants, making even perception-based out-of-distributio…

报道来源 [1]

The Perception-Physics Paradox: Probing Scientific Alignment with TC-Bench

相关实体

相关话题