English(EN) Towards Unified Song Generation and Singing Voice Conversion with Accompaniment Co-Generation

新模型统一语音和歌声生成

作者 PulseAugur 编辑部 · [3 个来源] · 2026-06-05 07:59

研究人员开发了新的统一模型，用于生成人类语音音频，能够同时生成语音和歌声。UniVoice 使用条件流匹配方法，分离内容、旋律和音色，从而能够独立控制语音韵律和歌唱旋律。UniSinger 基于多模态扩散 Transformer 构建，统一了说话人克隆歌曲生成与带伴奏的歌声转换。这两个模型在各自的任务上都展现了最先进的性能，为音频生成和音乐制作带来了新的可能性。 AI

影响这些模型推动了统一音频生成的最先进水平，可能对音乐制作和辅助工具产生影响。

排序理由两篇介绍新音频生成模型的学术论文。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。我们如何撰写摘要 →

报道来源 [3]

arXiv cs.AI TIER_1 English(EN) · Ziyu Zhang, Chunyu Qiang, Xiaopeng Wang, Yuxin Guo, Kang Yin, Wenjie Tian, Jingbin Hu, Tianlun Zuo, Zhao Guo, Teng Ma, Yuzhe Liang, Chen Zhang, Lei Xie · 2026-06-08 04:00

迈向统一的歌曲生成与歌声转换伴奏共生

arXiv:2606.07015v1 Announce Type: cross Abstract: While song generation and singing voice conversion (SVC) have evolved significantly, they have long been developed isolated: the former lacks zero-shot speaker cloning, while the latter overlooks vocal-accompaniment synergy. To br…
arXiv cs.AI TIER_1 English(EN) · Junjie Zheng, Huixin Xue, Shihong Ren, Chaofan Ding, Hao Liu, Zihao Chen · 2026-06-06 04:00

UniVoice：统一的语音和歌声生成模型

arXiv:2606.05852v1 Announce Type: cross Abstract: Text-to-speech (TTS) and singing voice synthesis (SVS) both aim to generate human vocal audio from symbolic inputs, but they impose different requirements on the generation process. Speech generation relies on flexible, language-d…
arXiv cs.AI TIER_1 English(EN) · Lei Xie · 2026-06-05 07:59

迈向统一的歌曲生成与歌声转换，并伴随伴奏的协同生成

While song generation and singing voice conversion (SVC) have evolved significantly, they have long been developed isolated: the former lacks zero-shot speaker cloning, while the latter overlooks vocal-accompaniment synergy. To bridge this gap, we propose UniSinger, the first end…

报道来源 [3]

迈向统一的歌曲生成与歌声转换伴奏共生

UniVoice：统一的语音和歌声生成模型

迈向统一的歌曲生成与歌声转换，并伴随伴奏的协同生成

相关实体

相关话题