Brief · PulseAugur

TOOL · arXiv cs.LG English(EN) · 7h

Multi-View Speech Representation Learning for Parkinson's Disease Detection Using Context-guided Cross-modal Attention

Researchers have developed a novel multi-branch deep learning framework designed to improve the detection of Parkinson's disease through speech analysis. This approach utilizes three distinct speech representations: Log-Mel spectrograms, MFCCs, and HuBERT embeddings, each processed by specialized neural networks. A key innovation is a context-guided cross-modal attention mechanism that dynamically integrates these diverse features, leading to enhanced accuracy in identifying the disease. AI

IMPACT This research demonstrates a novel approach to using AI for early disease detection, potentially improving diagnostic accuracy and patient outcomes.

HuBERT
Parkinson's disease
BiLSTM
ResNet-18
Multi-View Speech Representation Learning for Parkinson's Disease Detection Using Context-guided Cross-modal Attention
PC-GITA corpus