English(EN) What Does a Pathological Speech Assessment Model Know about Acoustic Features? A Case Study on Oral and Oropharyngeal Cancer Patients

Wav2Vec 2.0 模型可解释性用于病理性言语评估研究

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-23 07:37

研究人员调查了用于评估口腔癌和口咽癌患者病理性言语的 Wav2Vec 2.0 模型的可解释性。他们使用典型相关分析测量了模型嵌入与声学特征之间的相关性。研究发现，模型学习到的表征与频谱和韵律特征最相关，其中梅尔频率倒谱系数的第一阶在所有层中显示出最高的关联度。这项研究不仅有助于理解言语评估模型如何编码声学信息，还为在病理性言语分析中选择声学特征提供了实用见解。 AI

影响提供了对言语评估模型如何处理声学数据的见解，可能改进病理性言语分析。

排序理由学术论文，详细介绍了模型可解释性的案例研究。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.NE (Neural & Evolutionary) 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.NE (Neural & Evolutionary) TIER_1 English(EN) · Virginie Woisard · 2026-06-23 07:37

What Does a Pathological Speech Assessment Model Know about Acoustic Features? A Case Study on Oral and Oropharyngeal Cancer Patients

This work investigates the interpretability of a Wav2Vec 2.0based speech intelligibility assessment model for oral and oropharyngeal cancer patients through canonical correlation analysis. By measuring the correlation between the model embeddings and eGeMAPS low-level descriptors…

报道来源 [1]

What Does a Pathological Speech Assessment Model Know about Acoustic Features? A Case Study on Oral and Oropharyngeal Cancer Patients

相关实体

相关话题