English(EN) Unifying Acoustic Features and Text with Multimodal LLMs for Neurodegenerative Screening

多模态大语言模型在神经退行性疾病分期方面取得进展

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-16 04:00

研究人员开发了 NeurMLLM，这是一种新颖的多模态大语言模型，用于阿尔茨海默病和帕金森病等神经退行性疾病的分期。该框架将语音中的声学特征、文本转录和人口统计数据整合到一个统一的序列中供大语言模型使用。通过采用视觉变换器对音频频谱图和梅尔频率倒谱系数进行编码，NeurMLLM 在 Bridge2AI-Voice 数据集上的性能优于传统的机器学习和现有的基于大语言模型的方法，展示了多模态大语言模型在提高疾病分期准确性和可及性方面的潜力。 AI

影响这项研究展示了多模态大语言模型在医疗筛查方面的新应用，有望提高神经退行性疾病的诊断准确性和可及性。

排序理由该集群包含一篇详细介绍新模型及其在特定数据集上评估的学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Qingfeng Zhang, Yuanxiong Guo, Yanmin Gong · 2026-06-16 04:00

Unifying Acoustic Features and Text with Multimodal LLMs for Neurodegenerative Screening

arXiv:2606.14788v1 Announce Type: cross Abstract: Voice-based screening offers a scalable and non-invasive way to assess neurodegenerative diseases such as Alzheimer's disease (AD) and Parkinson's disease (PD), but their staging remains challenging due to the difficulty of integr…

报道来源 [1]

Unifying Acoustic Features and Text with Multimodal LLMs for Neurodegenerative Screening

相关实体

相关话题