Alibaba's Qwen team has released two new automatic speech recognition (ASR) models, Qwen3-ASR-1.7B-hf and Qwen3-ASR-0.6B-hf. Both models support 52 languages and dialects, with capabilities for offline and streaming inference. The larger 1.7B parameter model achieved a mean Word Error Rate (WER) of 5.59 on the Open ASR Leaderboard, while the smaller 0.6B model recorded a mean WER of 6.31. AI
IMPACT These models offer improved multilingual speech recognition capabilities, supporting both offline and streaming use cases.
RANK_REASON Release of new open-source models with benchmark results.
Read on Mastodon — mastodon.social →
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →