English(EN) SHOVIR: A Benchmark for Evaluating Vision Shortcut Learning in Radiology Report Generation

新基准SHOVIR旨在解决放射学AI中的视觉捷径学习问题

作者 PulseAugur 编辑部 · [4 个来源] · 2026-06-29 12:17

研究人员推出SHOVIR，这是一个旨在评估放射学报告生成（RRG）模型中视觉捷径学习的新基准。当前的RRG评估方法常常无法判断诊断陈述是否基于实际的视觉证据，导致模型利用虚假关联。SHOVIR通过使用带注释的数据集和遮挡实验来识别直接和上下文捷径，揭示了高性能模型可能仍然依赖肤浅的视觉证据。这项工作突显了RRG评估中的一个关键差距，并提倡使用区域感知评估协议。 AI

影响突显了当前医学影像AI评估中的一个关键差距，推动更强大、更基于视觉的评估。

排序理由该集群描述了一个新的基准和研究论文，用于评估特定领域中的AI模型。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。我们如何撰写摘要 →

报道来源 [4]

arXiv cs.AI TIER_1 English(EN) · Yucheng Chen, Jinjing Zhu, Yang Yu, Yufei Shi, Hane Naghshbandi, Jinhua Liu, Angela S. Koh, Fang Fen, Kian Eng Ong, Si Yong Yeo · 2026-07-01 04:00

Seeing Through Multiple Views: Parameter-Efficient Fine-Tuning via Selective Neurons for Consistent Radiology Report Generation

arXiv:2606.31099v1 Announce Type: cross Abstract: Recent years have seen substantial advances in radiology report generation (RRG), yet existing approaches predominantly adopt direct feature fusion when handling multi-view X-ray images. Such approaches overlook the potential clin…
arXiv cs.CL TIER_1 English(EN) · Filippo Ruffini, Marco Salm\'e, Rosa Sicilia, Valerio Guarrasi, Paolo Soda · 2026-06-30 04:00

SHOVIR: A Benchmark for Evaluating Vision Shortcut Learning in Radiology Report Generation

arXiv:2606.30201v1 Announce Type: cross Abstract: Current evaluation protocols for Vision-Language Models (VLMs) in Radiology Report Generation (RRG) rely on report-level metrics that measure lexical overlap or aggregate clinical correctness. However, such metrics do not test whe…
arXiv cs.CL TIER_1 English(EN) · Paolo Soda · 2026-06-29 12:17

SHOVIR: A Benchmark for Evaluating Vision Shortcut Learning in Radiology Report Generation

Current evaluation protocols for Vision-Language Models (VLMs) in Radiology Report Generation (RRG) rely on report-level metrics that measure lexical overlap or aggregate clinical correctness. However, such metrics do not test whether individual diagnostic statements stem from th…
arXiv cs.CV TIER_1 English(EN) · Miaojing Shi, Tianyu Cen, Zijie Yue, Meng Wei, Oluwatosin Alabi, Tom Vercauteren · 2026-06-30 04:00

Multimodal Large Language Model driven Radiology Report Generation with Clinical Knowledge Enhancement

arXiv:2403.06728v2 Announce Type: replace Abstract: Radiology report generation (RRG) has attracted significant attention due to its potential to reduce the workload of radiologists. The performance of current RRG approaches remains unsatisfactory against clinical standards. This…

报道来源 [4]

Seeing Through Multiple Views: Parameter-Efficient Fine-Tuning via Selective Neurons for Consistent Radiology Report Generation

SHOVIR: A Benchmark for Evaluating Vision Shortcut Learning in Radiology Report Generation

SHOVIR: A Benchmark for Evaluating Vision Shortcut Learning in Radiology Report Generation

Multimodal Large Language Model driven Radiology Report Generation with Clinical Knowledge Enhancement

相关实体

相关话题