Microsoft AI Introduces MAI-Transcribe-1.5: 2.4% WER on Artificial Analysis, Best-in-Class FLEURS Accuracy, and Up to 5x Faster Long-Audio Transcription
Microsoft AI has released MAI-Transcribe-1.5, an updated speech-to-text model supporting 43 languages and diverse acoustic conditions. The model achieves a 2.4% word error rate on the Artificial Analysis benchmark and boasts best-in-class accuracy on FLEURS. It also offers significant speed improvements, transcribing an hour of audio in under 15 seconds, and includes a keyword biasing feature to improve accuracy for domain-specific terms. AI
IMPACT Enhances transcription accuracy and speed for enterprise applications, potentially improving efficiency in content creation and analysis.