PulseAugur
EN
LIVE 11:27:11

Microsoft AI launches faster speech model with 2.4% WER

Microsoft AI has released MAI-Transcribe-1.5, a new speech recognition model that boasts a 2.4% word error rate across 43 languages. This model is designed for speed, operating up to five times faster than previous versions and capable of transcribing an hour of audio in under 15 seconds. MAI-Transcribe-1.5 has been integrated into Microsoft products such as Copilot, Teams, and GitHub. AI

IMPACT This release offers significant speed and accuracy improvements for speech-to-text, potentially enhancing user experiences in productivity tools.

RANK_REASON This is a significant product release from a major AI developer, featuring performance improvements and integration into key products. [lever_c_demoted from significant: ic=1 ai=1.0]

Read on Mastodon — fosstodon.org →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    Microsoft AI has unveiled MAI-Transcribe-1.5, a speech recognition model achieving 2.4% word error rate across 43 languages. The model runs up to 5x faster than

    Microsoft AI has unveiled MAI-Transcribe-1.5, a speech recognition model achieving 2.4% word error rate across 43 languages. The model runs up to 5x faster than comparable models and can transcribe an hour of audio in under 15 seconds. Integrated into Copilot, Teams and GitHub. h…