Brief · PulseAugur

RESEARCH · arXiv cs.CL English(EN) · 3d · [2 sources]

Phonetic Error Analysis of Raw Waveform Acoustic Models

Researchers have analyzed the error patterns of raw waveform acoustic models used for phonetic recognition on the TIMIT dataset. They decomposed the phone error rate (PER) across phonetic categories and constructed confusion matrices to understand substitution errors. The study found that their models achieved state-of-the-art results for raw waveform systems on TIMIT, and transfer learning from WSJ further improved performance, particularly for consonants. AI

IMPACT This research offers a deeper understanding of phonetic error patterns, potentially leading to more accurate speech recognition systems.

CNN
WSJ
TIMIT
SincNet
Sinc2Net
Bidirectional LSTMs