Brief · PulseAugur

RESEARCH · arXiv cs.CL English(EN) · 1d · [4 sources]

Speech-Driven End-to-End Language Discrimination towards Chinese Dialects

Two new research papers propose advanced methods for distinguishing between Chinese dialects, a task traditionally challenging due to limited text data. One paper introduces a speech-driven approach using Mel Frequency Cepstral Coefficients (MFCC) and a CNN-HMM-DNN model to identify dialects from audio. The second paper focuses on low-resource scenarios, employing transfer learning and data augmentation techniques with a CDDTLDA framework and self-attention to improve Automatic Speech Recognition for dialect discrimination. AI

IMPACT These papers advance AI capabilities in fine-grained language analysis, potentially improving applications in speech recognition and natural language processing for diverse linguistic communities.

CDDTLDA
CNN
Mel Frequency Cepstral Coefficients
self-attention
varieties of Chinese
HMM-DNN
Chinese dialect corpora
arXiv