PulseAugur
实时 07:10:35
中文(ZH) 拆解有道“子曰4”全量开源:如何通过重构思维链打下落地成本?

Youdao open-sources Confucius 4 multimodal LLM, cuts costs

NetEase Youdao has announced a significant upgrade to its "Confucius 4" large language model, now entering the multimodal era with support for text, image, and audio interactions. The company is open-sourcing its core multimodal and text-to-speech (TTS) models, aiming to reduce implementation costs for developers. The new model demonstrates state-of-the-art performance in visual mathematical reasoning and offers a 43.2% reduction in reasoning chain output length, leading to lower inference costs. AI

影响 Lowers barriers for developers in multimodal and speech synthesis, potentially accelerating AI Agent development and adoption.

排序理由 This is a significant product release and open-source initiative from a major tech company in the AI space. [lever_c_demoted from significant: ic=1 ai=1.0]

在 雷峰网 (Leiphone) 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

Youdao open-sources Confucius 4 multimodal LLM, cuts costs

报道来源 [1]

  1. 雷峰网 (Leiphone) TIER_1 中文(ZH) ·

    Deconstructing Youdao's 'Zhi Yue 4' Fully Open Source: How to Reduce Implementation Costs Through Refactored Chain-of-Thought?

    <p>近日,网易有道宣布“子曰”大模型迎来 4.0 版本的全方位升级。“子曰4&quot; 正式迈入全模态时代,不仅全面支持文本、图片、音频的融合交互,有道更宣布将核心的“多模态模型”与“语音合成(TTS)模型”正式开源。与此同时,翻译模型也迎来了深度的技术重构,翻译质量与效率实现双重提升。多模态模型视觉与数理斩获SOTA,纯文本数理难题性能行业领先据介绍,开源的“子曰4”多模态模型在 27B 参数规模上,面向教育场景,将支持视觉输入的数理能力拉到了行业顶尖水平(SOTA)。在同等参数规模的模型中,“子曰4”在处理带图表的数学题、物理题等高难度视觉数理问…