Microsoft发布VibeVoice，一款开源语音转文本AI模型

作者 PulseAugur 编辑部 · [6 个来源] · 2026-04-27 23:46

Microsoft发布了VibeVoice，一个内置说话人日志功能的开源语音转文本模型。该模型采用MIT许可，可本地部署，意味着音频数据无需发送至API。一位用户在MacBook Pro上测试了该模型，在不到九分钟的时间内转录了一小时的音频，但需要大量内存。 AI

影响为语音转文本转录提供了一个可自托管的开源替代方案，可能降低开发者的运营成本。

排序理由大型公司发布的开源模型，但并非顶级AI实验室的前沿模型发布。

在 Simon Willison 阅读 →

AI 生成摘要 · Google Gemini · 来自 6 个来源。我们如何撰写摘要 →

报道来源 [6]

Simon Willison TIER_1 English(EN) · 2026-04-27 23:46

microsoft/VibeVoice

<a href="https://github.com/microsoft/VibeVoice">microsoft/VibeVoice</a> VibeVoice is Microsoft's Whisper-style audio model for speech-to-text, MIT licensed and with speaker diarization built into the model. Microsoft released it on January 21st, 20…
Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] · 2026-04-29 09:49

VibeVoice：微软开源前沿语音AI https://github.com/microsoft/VibeVoice #ai #github #microsoft #open-source

VibeVoice: Open-source frontier voice AI https:// github.com/microsoft/VibeVoice # ai # github # microsoft # open -source

链接 github.com/…/VibeVoice
Mastodon — mastodon.social TIER_1 English(EN) · [email protected] · 2026-04-28 14:43

VibeVoice 是一个开源前沿语音AI模型家族，包含文本到语音（TTS）和自动语音识别（ASR）模型。https://

VibeVoice is a family of # opensource frontier # voiceAI models that includes both Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) models. https:// github.com/microsoft/VibeVoice # AI # github # microsoft

链接 github.com/…/VibeVoice
Mastodon — mastodon.social TIER_1 English(EN) · Techino · 2026-04-28 13:00

🔓 开源 VibeVoice 已上线 — 微软推出的带内置说话人分离的 MIT 许可语音转文本模型。开源权重，无需 API 调用，您的音频

🔓 OPEN SOURCE VibeVoice just went live — Microsoft's MIT-licensed speech-to-text model with built-in speaker diarization. Open-weight, no API calls, your audio never leaves your infra. If you're building call analytics, meeting tools, or any transcription pipeline, this cuts your…
Mastodon — mastodon.social TIER_1 English(EN) · CuratedHackerNews · 2026-04-28 12:39

Microsoft VibeVoice：开源前沿语音AI https:// github.com/microsoft/VibeVoice # ai # github # microsoft # open -source

Microsoft VibeVoice: Open-Source Frontier Voice AI https:// github.com/microsoft/VibeVoice # ai # github # microsoft # open -source

链接 github.com/…/VibeVoice
Mastodon — mastodon.social TIER_1 English(EN) · ngate · 2026-04-28 12:36

啊，#微软，像一位数字堂吉诃德一样，勇敢地冲向开源#AI前沿，挑战着相关性的风车。🤖💡与此同时，#GitHub用户

Ah, # Microsoft , bravely charging into the open-source # AI frontier like a digital Don Quixote tilting at windmills of relevance. 🤖💡 Meanwhile, # GitHub users everywhere are left wondering if "VibeVoice" is the next big thing or just another buzzword salad pretending to be # in…

链接 github.com/…/VibeVoice

报道来源 [6]

microsoft/VibeVoice

VibeVoice：微软开源前沿语音AI https://github.com/microsoft/VibeVoice #ai #github #microsoft #open-source

VibeVoice 是一个开源前沿语音AI模型家族，包含文本到语音（TTS）和自动语音识别（ASR）模型。https://

🔓 开源 VibeVoice 已上线 — 微软推出的带内置说话人分离的 MIT 许可语音转文本模型。开源权重，无需 API 调用，您的音频

Microsoft VibeVoice：开源前沿语音AI https:// github.com/microsoft/VibeVoice # ai # github # microsoft # open -source

啊，#微软，像一位数字堂吉诃德一样，勇敢地冲向开源#AI前沿，挑战着相关性的风车。🤖💡与此同时，#GitHub用户

相关实体

相关话题