Towards Clinically Interpretable Ophthalmic VQA via Spatially-Grounded Lesion Evidence
Researchers have developed FundusGround, a new benchmark for ophthalmic visual question answering (VQA) that emphasizes clinical interpretability and evidence grounding. The benchmark includes over 10,000 fundus images with meticulously annotated lesions, spatially localized using the ETDRS grid for clinical relevance. This dataset supports the generation of over 72,000 questions, and experiments show that incorporating lesion-level evidence improves both model accuracy and transparency in medical VQA systems. AI
IMPACT Introduces a new benchmark for training and evaluating medical AI, potentially improving diagnostic transparency and accuracy in ophthalmology.