Researchers have introduced PubMed-Ophtha, a new dataset comprising over 100,000 ophthalmology image-caption pairs extracted from scientific literature. This dataset aims to address the scarcity of high-quality data for training vision-language models in the medical field. The extraction process involves detailed decomposition of figures from PDFs, annotation of imaging modalities, and sophisticated LLM-based caption splitting, achieving high accuracy in detection and extraction tasks. AI
影响 Provides a large-scale, annotated dataset to accelerate the development of specialized vision-language models in ophthalmology.
排序理由 The cluster describes the release of a new dataset and associated models for research purposes.
AI 生成摘要 · Google Gemini · 来自 5 个来源。 我们如何撰写摘要 →