English(EN) Moonshot AI open-sources Kimi-Audio-7B: a unified foundation model for audio understanding, generation, and conversation. Trained on 13M+ hours of data, achieve

Moonshot AI 开源 Kimi-Audio-7B 用于音频任务

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-12 09:19

Moonshot AI 发布了 Kimi-Audio-7B，一个用于音频任务的开源基础模型。该模型能够使用音频进行理解、生成和对话。它在超过 1300 万小时的数据上进行了训练，并在包括 LibriSpeech 和 VoiceBench 在内的多个基准测试中展示了最先进的性能。此次发布包括推理代码、微调示例和评估工具包。 AI

影响提供了一个新的开源音频处理基础模型，可能加速语音技术的研究和开发。

排序理由发布了一个新的音频基础模型的开源版本，并附有基准测试结果。[lever_c_demoted from research: ic=1 ai=1.0]

在 Mastodon — sigmoid.social 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

Mastodon — sigmoid.social TIER_1 English(EN) · [email protected] · 2026-05-12 09:19

Moonshot AI open-sources Kimi-Audio-7B: a unified foundation model for audio understanding, generation, and conversation. Trained on 13M+ hours of data, achieve

Moonshot AI open-sources Kimi-Audio-7B: a unified foundation model for audio understanding, generation, and conversation. Trained on 13M+ hours of data, achieves SOTA results on LibriSpeech, AISHELL, and VoiceBench. Includes inference code, fine-tuning examples, and evaluation to…

链接 github.com/…/Kimi-Audio

报道来源 [1]

Moonshot AI open-sources Kimi-Audio-7B: a unified foundation model for audio understanding, generation, and conversation. Trained on 13M+ hours of data, achieve

相关实体

相关话题