English(EN) Revisiting Performance Claims for Chest X-Ray Models Using Clinical Context

研究发现胸部X光AI模型可能夸大临床效用

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-29 04:00

一篇新发表在arXiv上的研究论文表明，当前用于诊断胸部X光的机器学习模型可能夸大了其在现实世界中的临床效用。该研究纳入了患者出院小结等临床背景信息，发现模型性能（以AUROC和其他指标衡量）在已有较高患病概率的患者身上显著下降。这表明这些模型在高风险患者群体中可能表现更差，突显了报告的平均性能与实际临床适用性之间的差距。 AI

影响强调了AI诊断工具在真实临床环境中，尤其是在高风险患者方面，可能存在的性能高估问题。

排序理由一篇发表在arXiv上的研究论文，详细介绍了一种用于评估ML模型的新方法。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.LG TIER_1 English(EN) · Andrew Wang, Jiashuo Zhang, Michael Oberst · 2026-06-29 04:00

Revisiting Performance Claims for Chest X-Ray Models Using Clinical Context

arXiv:2509.19671v3 Announce Type: replace Abstract: Public datasets of Chest X-Rays (CXRs) have long been a popular benchmark for developing machine learning (ML) computer vision models in healthcare. However, the reported strong average-case performance of these models do not ne…

报道来源 [1]

Revisiting Performance Claims for Chest X-Ray Models Using Clinical Context

相关实体

相关话题