PulseAugur
实时 11:33:01
English(EN) Domain Fine-Tuning vs. Retrieval-Augmented Generation for Medical Multiple-Choice Question Answering: A Controlled Comparison at the 4B-Parameter Scale

新的RAG方法用于医学QA,结果喜忧参半,多模态方法在大规模上优于微调

研究人员开发了MED-VRAG,一个新颖的迭代多模态检索增强生成框架,该框架处理医学文档页面图像,包括表格和图形,而不仅仅是文本。该系统在四个医学QA基准测试中的平均准确率为78.6%,比基线高5.8个百分点,比MedRAG + GPT-4的比较高1.8个百分点。另外,一项在4B参数模型上比较领域微调与RAG在医学问答中的研究发现,微调带来了显著的6.8个百分点的准确率提升,而RAG未显示统计学上的显著改进。 AI

影响 新的多模态RAG技术在医学QA方面显示出潜力,而微调似乎比RAG在特定基准测试的小型模型上更有效。

排序理由 两篇不同的arXiv论文,提出了用于医学问答系统的新颖方法和比较分析。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 5 个来源。 我们如何撰写摘要 →

新的RAG方法用于医学QA,结果喜忧参半,多模态方法在大规模上优于微调

报道来源 [5]

  1. arXiv cs.AI TIER_1 English(EN) · Xupeng Chen, Binbin Shi, Chenqian Le, Jiaqi Zhang, Kewen Wang, Ran Gong, Jinhan Zhang, Chihang Wang ·

    Iterative Multimodal Retrieval-Augmented Generation for Medical Question Answering

    arXiv:2604.27724v1 Announce Type: new Abstract: Medical retrieval-augmented generation (RAG) systems typically operate on text chunks extracted from biomedical literature, discarding the rich visual content (tables, figures, structured layouts) of original document pages. We prop…

  2. arXiv cs.AI TIER_1 English(EN) · Chihang Wang ·

    Iterative Multimodal Retrieval-Augmented Generation for Medical Question Answering

    Medical retrieval-augmented generation (RAG) systems typically operate on text chunks extracted from biomedical literature, discarding the rich visual content (tables, figures, structured layouts) of original document pages. We propose MED-VRAG, an iterative multimodal RAG framew…

  3. Hugging Face Daily Papers TIER_1 English(EN) ·

    Iterative Multimodal Retrieval-Augmented Generation for Medical Question Answering

    Medical retrieval-augmented generation (RAG) systems typically operate on text chunks extracted from biomedical literature, discarding the rich visual content (tables, figures, structured layouts) of original document pages. We propose MED-VRAG, an iterative multimodal RAG framew…

  4. arXiv cs.CL TIER_1 English(EN) · Avi-ad Avraam Buskila ·

    Domain Fine-Tuning vs. Retrieval-Augmented Generation for Medical Multiple-Choice Question Answering: A Controlled Comparison at the 4B-Parameter Scale

    arXiv:2604.23801v1 Announce Type: new Abstract: Practitioners deploying small open-weight large language models (LLMs) for medical question answering face a recurring design choice: invest in a domain-fine-tuned model, or keep a general-purpose model and inject domain knowledge a…

  5. arXiv cs.CL TIER_1 English(EN) · Avi-ad Avraam Buskila ·

    Domain Fine-Tuning vs. Retrieval-Augmented Generation for Medical Multiple-Choice Question Answering: A Controlled Comparison at the 4B-Parameter Scale

    Practitioners deploying small open-weight large language models (LLMs) for medical question answering face a recurring design choice: invest in a domain-fine-tuned model, or keep a general-purpose model and inject domain knowledge at inference time via retrieval-augmented generat…