PulseAugur
EN
LIVE 15:44:45
中文(ZH) 阿里语音大模型登顶Speech Arena国产第一,全球第五

Alibaba AI voice model ranks 5th globally, leads China on Speech Arena

Alibaba's new AI voice model, Fun-Realtime-TTS-Preview, has achieved a top global ranking on the Speech Arena benchmark, securing fifth place worldwide and first place in China. The model demonstrated strong performance across multiple voice capabilities, including speech-to-text (ASR), text-to-speech (TTS), and end-to-end voice understanding and conversation (Chat). Notably, Alibaba's ASR model also achieved the lowest word error rate in a separate evaluation, highlighting its accuracy in transcribing speech. AI

IMPACT Demonstrates advanced capabilities in voice AI, particularly for diverse languages and accents, potentially influencing future voice assistant development.

RANK_REASON Significant benchmark result for a major tech company's AI model, outperforming competitors.

Read on 36氪 (36Kr) →

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

Alibaba AI voice model ranks 5th globally, leads China on Speech Arena

COVERAGE [3]

  1. 雷峰网 (Leiphone) TIER_1 中文(ZH) ·

    ASR, TTS, and Chat are all first, Alibaba's speech large model achieves a 'grand slam'

    <p>5月28日,在全球权威 AI 评测平台 Artificial Analysis的语音排行榜(Speech Arena)上,阿里巴巴语音大模型Fun-Realtime-TTS-Preview 以 1190 分的 Elo 评分位列全球第五、国产第一。</p><p><span style="color: #FFFFFF;">雷峰网</span></p>

  2. 36氪 (36Kr) TIER_1 中文(ZH) ·

    Alibaba's Speech Large Model Ranks First in China and Fifth Globally on Speech Arena

    36氪获悉,5月28日,在全球权威AI评测平台Artificial Analysis的语音排行榜(Speech Arena)上,阿里巴巴语音大模型Fun-Realtime-TTS-Preview 以1190分的Elo评分位列全球第五、国产第一。在ASR(将语音转为文字)、Chat(端到端的语音理解与对话)以及TTS(将文字转为语音)三个赛道,均斩获全国第一。

  3. SCMP — Tech TIER_1 English(EN) · Minxiao Chang ·

    Alibaba AI voice model cracks top 5 globally, outperforming US rivals in regional accents

    A new artificial intelligence voice model from Alibaba Group Holding has beaten out Western rivals OpenAI and xAI on a major global benchmark, underscoring its technical edge in capturing complex Chinese dialects and accents. Fun-Realtime-TTS-Preview, developed by Alibaba’s Tongy…