English(EN) Domain Fine-Tuning vs. Retrieval-Augmented Generation for Medical Multiple-Choice Question Answering: A Controlled Comparison at the 4B-Parameter Scale

新的RAG方法用于医学QA，结果喜忧参半，多模态方法在大规模上优于微调

作者 PulseAugur 编辑部 · [5 个来源] · 2026-04-26 16:49

研究人员开发了MED-VRAG，一个新颖的迭代多模态检索增强生成框架，该框架处理医学文档页面图像，包括表格和图形，而不仅仅是文本。该系统在四个医学QA基准测试中的平均准确率为78.6%，比基线高5.8个百分点，比MedRAG + GPT-4的比较高1.8个百分点。另外，一项在4B参数模型上比较领域微调与RAG在医学问答中的研究发现，微调带来了显著的6.8个百分点的准确率提升，而RAG未显示统计学上的显著改进。 AI

影响新的多模态RAG技术在医学QA方面显示出潜力，而微调似乎比RAG在特定基准测试的小型模型上更有效。

排序理由两篇不同的arXiv论文，提出了用于医学问答系统的新颖方法和比较分析。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 5 个来源。我们如何撰写摘要 →

报道来源 [5]

arXiv cs.AI TIER_1 English(EN) · Xupeng Chen, Binbin Shi, Chenqian Le, Jiaqi Zhang, Kewen Wang, Ran Gong, Jinhan Zhang, Chihang Wang · 2026-05-01 04:00

Iterative Multimodal Retrieval-Augmented Generation for Medical Question Answering

arXiv:2604.27724v1 Announce Type: new Abstract: Medical retrieval-augmented generation (RAG) systems typically operate on text chunks extracted from biomedical literature, discarding the rich visual content (tables, figures, structured layouts) of original document pages. We prop…
arXiv cs.AI TIER_1 English(EN) · Chihang Wang · 2026-04-30 11:16

Iterative Multimodal Retrieval-Augmented Generation for Medical Question Answering

Medical retrieval-augmented generation (RAG) systems typically operate on text chunks extracted from biomedical literature, discarding the rich visual content (tables, figures, structured layouts) of original document pages. We propose MED-VRAG, an iterative multimodal RAG framew…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-04-30 11:16

Iterative Multimodal Retrieval-Augmented Generation for Medical Question Answering

Medical retrieval-augmented generation (RAG) systems typically operate on text chunks extracted from biomedical literature, discarding the rich visual content (tables, figures, structured layouts) of original document pages. We propose MED-VRAG, an iterative multimodal RAG framew…
arXiv cs.CL TIER_1 English(EN) · Avi-ad Avraam Buskila · 2026-04-28 04:00

Domain Fine-Tuning vs. Retrieval-Augmented Generation for Medical Multiple-Choice Question Answering: A Controlled Comparison at the 4B-Parameter Scale

arXiv:2604.23801v1 Announce Type: new Abstract: Practitioners deploying small open-weight large language models (LLMs) for medical question answering face a recurring design choice: invest in a domain-fine-tuned model, or keep a general-purpose model and inject domain knowledge a…
arXiv cs.CL TIER_1 English(EN) · Avi-ad Avraam Buskila · 2026-04-26 16:49

Domain Fine-Tuning vs. Retrieval-Augmented Generation for Medical Multiple-Choice Question Answering: A Controlled Comparison at the 4B-Parameter Scale

Practitioners deploying small open-weight large language models (LLMs) for medical question answering face a recurring design choice: invest in a domain-fine-tuned model, or keep a general-purpose model and inject domain knowledge at inference time via retrieval-augmented generat…

报道来源 [5]

Iterative Multimodal Retrieval-Augmented Generation for Medical Question Answering

Iterative Multimodal Retrieval-Augmented Generation for Medical Question Answering

Iterative Multimodal Retrieval-Augmented Generation for Medical Question Answering

Domain Fine-Tuning vs. Retrieval-Augmented Generation for Medical Multiple-Choice Question Answering: A Controlled Comparison at the 4B-Parameter Scale

Domain Fine-Tuning vs. Retrieval-Augmented Generation for Medical Multiple-Choice Question Answering: A Controlled Comparison at the 4B-Parameter Scale

相关实体

相关话题