PulseAugur
实时 21:17:43
English(EN) SHOVIR: A Benchmark for Evaluating Vision Shortcut Learning in Radiology Report Generation

新基准SHOVIR旨在解决放射学AI中的视觉捷径学习问题

研究人员推出SHOVIR,这是一个旨在评估放射学报告生成(RRG)模型中视觉捷径学习的新基准。当前的RRG评估方法常常无法判断诊断陈述是否基于实际的视觉证据,导致模型利用虚假关联。SHOVIR通过使用带注释的数据集和遮挡实验来识别直接和上下文捷径,揭示了高性能模型可能仍然依赖肤浅的视觉证据。这项工作突显了RRG评估中的一个关键差距,并提倡使用区域感知评估协议。 AI

影响 突显了当前医学影像AI评估中的一个关键差距,推动更强大、更基于视觉的评估。

排序理由 该集群描述了一个新的基准和研究论文,用于评估特定领域中的AI模型。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 4 个来源。 我们如何撰写摘要 →

新基准SHOVIR旨在解决放射学AI中的视觉捷径学习问题

报道来源 [4]

  1. arXiv cs.AI TIER_1 English(EN) · Yucheng Chen, Jinjing Zhu, Yang Yu, Yufei Shi, Hane Naghshbandi, Jinhua Liu, Angela S. Koh, Fang Fen, Kian Eng Ong, Si Yong Yeo ·

    Seeing Through Multiple Views: Parameter-Efficient Fine-Tuning via Selective Neurons for Consistent Radiology Report Generation

    arXiv:2606.31099v1 Announce Type: cross Abstract: Recent years have seen substantial advances in radiology report generation (RRG), yet existing approaches predominantly adopt direct feature fusion when handling multi-view X-ray images. Such approaches overlook the potential clin…

  2. arXiv cs.CL TIER_1 English(EN) · Filippo Ruffini, Marco Salm\'e, Rosa Sicilia, Valerio Guarrasi, Paolo Soda ·

    SHOVIR: A Benchmark for Evaluating Vision Shortcut Learning in Radiology Report Generation

    arXiv:2606.30201v1 Announce Type: cross Abstract: Current evaluation protocols for Vision-Language Models (VLMs) in Radiology Report Generation (RRG) rely on report-level metrics that measure lexical overlap or aggregate clinical correctness. However, such metrics do not test whe…

  3. arXiv cs.CL TIER_1 English(EN) · Paolo Soda ·

    SHOVIR: A Benchmark for Evaluating Vision Shortcut Learning in Radiology Report Generation

    Current evaluation protocols for Vision-Language Models (VLMs) in Radiology Report Generation (RRG) rely on report-level metrics that measure lexical overlap or aggregate clinical correctness. However, such metrics do not test whether individual diagnostic statements stem from th…

  4. arXiv cs.CV TIER_1 English(EN) · Miaojing Shi, Tianyu Cen, Zijie Yue, Meng Wei, Oluwatosin Alabi, Tom Vercauteren ·

    Multimodal Large Language Model driven Radiology Report Generation with Clinical Knowledge Enhancement

    arXiv:2403.06728v2 Announce Type: replace Abstract: Radiology report generation (RRG) has attracted significant attention due to its potential to reduce the workload of radiologists. The performance of current RRG approaches remains unsatisfactory against clinical standards. This…