A research paper details a system developed for the 2024 Text-Dependent Speaker Verification (TdSV) Challenge, achieving a Minimum Detection Cost Function of 0.0461 and an Equal Error Rate of 1.3%. The system adapted existing neural networks like ResNet-TDNN and NeXt-TDNN, originally trained on VoxCeleb, due to time and resource constraints. An EfficientNet-A0 model was also trained on the challenge dataset to enhance adaptation and bolster an ensemble approach, demonstrating the efficacy of multi-model ensembles for speaker and phrase verification. AI
IMPACT This research contributes to advancements in speaker verification technology, potentially improving security and user authentication systems.
RANK_REASON The cluster contains a research paper detailing a system's performance in a specific challenge. [lever_c_demoted from research: ic=1 ai=1.0]
- EfficientNet-A0
- NeXt-TDNN
- ResNet-TDNN
- Team Naive System
- Text-Dependent Speaker Verification (TdSV) Challenge 2024
- VoxCeleb
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →