PulseAugur
实时 00:58:01
English(EN) BenchX: Benchmarking AI Models for Cancer Detection and Localization with Demographic and Protocol Biases

新的BenchX基准揭示AI癌症检测模型在不同患者亚组中表现不佳

一项名为BenchX的新基准已被开发出来,用于评估用于癌症检测和定位的AI模型。该基准包含85,355张CT扫描图像,评估了12种AI模型在不同患者人口统计学和成像方案下的性能。研究结果表明,针对平均准确率优化的AI模型在代表性不足的亚组(如年轻、女性非洲裔美国人)上的表现往往不佳,这凸显了在医学AI中进行亚组级别评估的关键需求。 AI

影响 强调了在医学影像领域,尤其是在代表性不足的患者群体中,对更强大、更公平的AI模型的需求。

排序理由 该集群包含一篇详细介绍AI模型新基准的学术论文。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

新的BenchX基准揭示AI癌症检测模型在不同患者亚组中表现不佳

报道来源 [2]

  1. arXiv cs.CV TIER_1 English(EN) · Qi Chen, Wenxuan Li, Pedro R. A. S. Bassi, Xinze Zhou, Jakob Wasserthal, Ibrahim Ethem Hamamci, Sezgin Er, Ashwin Kumar, Yiwen Ye, Yuhan Wang, Yuyin Zhou, Akshay S. Chaudhari, Curtis Langlotz, Kang Wang, Yang Yang, Alan L. Yuille, Zongwei Zhou ·

    BenchX:用于癌症检测和定位的 AI 模型基准测试,包含人口统计学和协议偏差

    arXiv:2606.24883v1 Announce Type: new Abstract: Artificial intelligence (AI) has achieved remarkable success in medical imaging, but it is widely recognized that these models often perform inconsistently across real-world clinical settings. Such inconsistencies occur when patient…

  2. arXiv cs.CV TIER_1 English(EN) · Zongwei Zhou ·

    BenchX:用于癌症检测和定位的 AI 模型基准测试,包含人口统计学和协议偏差

    Artificial intelligence (AI) has achieved remarkable success in medical imaging, but it is widely recognized that these models often perform inconsistently across real-world clinical settings. Such inconsistencies occur when patient demographics and imaging protocols vary, for ex…