PulseAugur
LIVE 08:24:55
research · [6 sources] ·
0
research

Microsoft releases VibeVoice, an open-source speech-to-text AI model

Microsoft has released VibeVoice, an open-source speech-to-text model with built-in speaker diarization. The MIT-licensed model is available for local deployment, meaning audio data does not need to be sent to an API. One user tested the model on a MacBook Pro, transcribing an hour of audio in under nine minutes, though it required significant RAM. AI

Summary written by gemini-2.5-flash-lite from 6 sources. How we write summaries →

IMPACT Provides a self-hostable, open-source alternative for speech-to-text transcription, potentially reducing operational costs for developers.

RANK_REASON Open-source model release from a major company, but not a frontier model release from a top-tier AI lab.

Read on Simon Willison →

Microsoft releases VibeVoice, an open-source speech-to-text AI model

COVERAGE [6]

  1. Simon Willison TIER_1 ·

    microsoft/VibeVoice

    <p><strong><a href="https://github.com/microsoft/VibeVoice">microsoft/VibeVoice</a></strong></p> VibeVoice is Microsoft's Whisper-style audio model for speech-to-text, MIT licensed and with speaker diarization built into the model.</p> <p>Microsoft released it on January 21st, 20…

  2. Mastodon — fosstodon.org TIER_1 · [email protected] ·

    VibeVoice: Open-source frontier voice AI https:// github.com/microsoft/VibeVoice # ai # github # microsoft # open -source

    VibeVoice: Open-source frontier voice AI https:// github.com/microsoft/VibeVoice # ai # github # microsoft # open -source

  3. Mastodon — mastodon.social TIER_1 · [email protected] ·

    VibeVoice is a family of # opensource frontier # voiceAI models that includes both Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) models. https://

    VibeVoice is a family of # opensource frontier # voiceAI models that includes both Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) models. https:// github.com/microsoft/VibeVoice # AI # github # microsoft

  4. Mastodon — mastodon.social TIER_1 · Techino ·

    🔓 OPEN SOURCE VibeVoice just went live — Microsoft's MIT-licensed speech-to-text model with built-in speaker diarization. Open-weight, no API calls, your audio

    🔓 OPEN SOURCE VibeVoice just went live — Microsoft's MIT-licensed speech-to-text model with built-in speaker diarization. Open-weight, no API calls, your audio never leaves your infra. If you're building call analytics, meeting tools, or any transcription pipeline, this cuts your…

  5. Mastodon — mastodon.social TIER_1 · CuratedHackerNews ·

    Microsoft VibeVoice: Open-Source Frontier Voice AI https:// github.com/microsoft/VibeVoice # ai # github # microsoft # open -source

    Microsoft VibeVoice: Open-Source Frontier Voice AI https:// github.com/microsoft/VibeVoice # ai # github # microsoft # open -source

  6. Mastodon — mastodon.social TIER_1 · ngate ·

    Ah, # Microsoft , bravely charging into the open-source # AI frontier like a digital Don Quixote tilting at windmills of relevance. 🤖💡 Meanwhile, # GitHub users

    Ah, # Microsoft , bravely charging into the open-source # AI frontier like a digital Don Quixote tilting at windmills of relevance. 🤖💡 Meanwhile, # GitHub users everywhere are left wondering if "VibeVoice" is the next big thing or just another buzzword salad pretending to be # in…