Researchers have developed a new method for compressing speech foundation models without requiring additional data or retraining. This approach utilizes channelwise clustering with k-means to achieve parameter compression, exploring mixed sparsity pruning by varying the number of clusters per layer. Experiments on LibriSpeech demonstrated significant word error rate (WER) reductions compared to magnitude-based pruning on models like HuBERT-large and Whisper-large-v3, even with substantial sparsity levels. AI
IMPACT This compression technique could enable more efficient deployment of large speech models on resource-constrained devices.
RANK_REASON The cluster contains an academic paper detailing a new research method for model compression.
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →