Microsoft AI has released MAI-Transcribe-1.5, an updated speech-to-text model supporting 43 languages and diverse acoustic conditions. The model achieves a 2.4% word error rate on the Artificial Analysis benchmark and boasts best-in-class accuracy on FLEURS. It also offers significant speed improvements, transcribing an hour of audio in under 15 seconds, and includes a keyword biasing feature to improve accuracy for domain-specific terms. AI
IMPACT Enhances transcription accuracy and speed for enterprise applications, potentially improving efficiency in content creation and analysis.
RANK_REASON This is a product update for an existing speech-to-text model, not a novel frontier model release.
- Copilot
- GitHub
- MAI-Transcribe-1.5
- Microsoft AI
- Teams
- Artificial Analysis
- Dynamics 365 Contact Centre
- Foundry
- GPT-4o-Transcribe
- Scribe v2
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →