Researchers have developed QC-GAN, a new parameter-efficient framework for speech enhancement that combines a Quaternion Conformer generator with MetricGAN-based training. This approach utilizes the Hamilton product to encode magnitude and phase, significantly reducing parameters while maintaining interdependencies. A metric-learning discriminator optimizes perceptual quality, achieving a PESQ score of 3.48 with only 0.89M parameters on the VoiceBank+DEMAND dataset, and a smaller variant with 35K parameters also showed strong performance. The model demonstrated generalization capabilities on the DNS-Challenge 3 dataset. AI
IMPACT This research introduces a more parameter-efficient approach to speech enhancement, potentially enabling higher-quality audio processing on devices with limited computational resources.
RANK_REASON The cluster contains an academic paper detailing a new model and methodology for speech enhancement.
- arXiv
- DNS-Challenge 3
- Hamilton product
- MetricGAN
- Pesquet
- QC-GAN
- Quaternion Conformer
- VoiceBank+DEMAND
- Hamilton
- Quaternion Conformer GAN
AI-generated summary · Google Gemini · from 3 sources. How we write summaries →