PulseAugur
实时 13:12:29

New dataset and model boost medical dialogue for Indic languages

Researchers have developed IndicMedDialog, a new dataset designed to improve medical dialogue systems for Indic languages. This dataset includes parallel multi-turn conversations in English and nine Indic languages, created by augmenting existing medical dialogue data with LLM-generated synthetic consultations. The team also fine-tuned a small language model, IndicMedLM, using this dataset to enable personalized, multi-turn symptom elicitation with optional patient pre-context. AI

影响 Enhances accessibility of AI-powered healthcare tools for speakers of Indic languages.

排序理由 Publication of an academic paper detailing a new dataset and model for multilingual medical dialogue. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

New dataset and model boost medical dialogue for Indic languages

报道来源 [1]

  1. arXiv cs.CL TIER_1 English(EN) · Piyush Patel ·

    IndicMedDialog: A Parallel Multi-Turn Medical Dialogue Dataset for Accessible Healthcare in Indic Languages

    Most existing medical dialogue systems operate in a single-turn question--answering paradigm or rely on template-based datasets, limiting conversational realism and multilingual applicability. We introduce IndicMedDialog, a parallel multi-turn medical dialogue dataset spanning En…