Speech separation study reveals SI-SDR limitations with noisy references

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-04 04:00

Researchers have investigated the effectiveness of the Scale-Invariant Signal-to-Distortion Ratio (SI-SDR) in speech separation when training data includes noisy references. Their analysis revealed that noise in references can limit achievable SI-SDR and introduce unwanted noise into separated outputs. To mitigate this, they proposed a method to enhance references and augment training data, which showed reduced noise but also potential for processing artifacts. AI

影响 Highlights limitations in current speech separation metrics, potentially guiding future research in audio AI.

排序理由 This is a research paper detailing a study on a specific metric within speech separation. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Simon Dahl Jepsen, Mads Gr{\ae}sb{\o}ll Christensen, Jesper Rindom Jensen · 2026-06-04 04:00

A Study of the Scale Invariant Signal to Distortion Ratio in Speech Separation with Noisy References

arXiv:2508.14623v2 Announce Type: replace-cross Abstract: This paper examines the implications of using the Scale-Invariant Signal-to-Distortion Ratio (SI-SDR) as both evaluation and training objective in supervised speech separation, when the training references contain noise, a…

报道来源 [1]

A Study of the Scale Invariant Signal to Distortion Ratio in Speech Separation with Noisy References

相关话题