PulseAugur
实时 15:48:02
English(EN) Towards Clinically Interpretable Ophthalmic VQA via Spatially-Grounded Lesion Evidence

新基准增强眼科视觉问答的临床可解释性

研究人员开发了FundusGround,一个用于眼科视觉问答(VQA)的新基准,该基准强调临床可解释性和证据定位。该基准包含超过10,000张眼底图像,并使用ETDRS网格对病灶进行精细标注和空间定位,以确保临床相关性。该数据集支持生成超过72,000个问题,实验表明,纳入病灶级别的证据可以提高医学VQA系统的模型准确性和透明度。 AI

影响 引入了一个用于训练和评估医学AI的新基准,有望提高眼科诊断的透明度和准确性。

排序理由 该集群描述了一篇介绍特定AI应用新基准和方法论的学术论文。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.AI TIER_1 English(EN) · Jiang Liu ·

    Towards Clinically Interpretable Ophthalmic VQA via Spatially-Grounded Lesion Evidence

    Visual Question Answering (VQA) holds great promise for clinical support, particularly in ophthalmology, where retinal fundus photography is essential for diagnosis. However, ophthalmic VQA benchmarks primarily emphasize answer accuracy, neglecting the explicit visual evidence ne…

  2. arXiv cs.CV TIER_1 English(EN) · Xingyue Wang, Bo Liu, Meng Wang, Zhixuan Zhang, Chengcheng Zhu, Huazhu Fu, Jiang Liu ·

    Towards Clinically Interpretable Ophthalmic VQA via Spatially-Grounded Lesion Evidence

    arXiv:2605.22414v1 Announce Type: new Abstract: Visual Question Answering (VQA) holds great promise for clinical support, particularly in ophthalmology, where retinal fundus photography is essential for diagnosis. However, ophthalmic VQA benchmarks primarily emphasize answer accu…