PulseAugur
LIVE 20:41:38
research · [1 source] · · 中文(ZH) 拆解有道“子曰4”全量开源:如何通过重构思维链打下落地成本?
24
research

Youdao open-sources Confucius 4 multimodal LLM, cuts costs

NetEase Youdao has announced a significant upgrade to its "Confucius 4" large language model, now entering the multimodal era with support for text, image, and audio interactions. The company is open-sourcing its core multimodal and text-to-speech (TTS) models, aiming to reduce implementation costs for developers. The new model demonstrates state-of-the-art performance in visual mathematical reasoning and offers a 43.2% reduction in reasoning chain output length, leading to lower inference costs. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Lowers barriers for developers in multimodal and speech synthesis, potentially accelerating AI Agent development and adoption.

RANK_REASON This is a significant product release and open-source initiative from a major tech company in the AI space. [lever_c_demoted from significant: ic=1 ai=1.0]

Read on 雷峰网 (Leiphone) →

Youdao open-sources Confucius 4 multimodal LLM, cuts costs

COVERAGE [1]

  1. 雷峰网 (Leiphone) TIER_1 中文(ZH) ·

    Deconstructing Youdao's 'Zhi Yue 4' Fully Open Source: How to Reduce Implementation Costs Through Refactored Chain-of-Thought?

    <p>近日,网易有道宣布“子曰”大模型迎来 4.0 版本的全方位升级。“子曰4&quot; 正式迈入全模态时代,不仅全面支持文本、图片、音频的融合交互,有道更宣布将核心的“多模态模型”与“语音合成(TTS)模型”正式开源。与此同时,翻译模型也迎来了深度的技术重构,翻译质量与效率实现双重提升。多模态模型视觉与数理斩获SOTA,纯文本数理难题性能行业领先据介绍,开源的“子曰4”多模态模型在 27B 参数规模上,面向教育场景,将支持视觉输入的数理能力拉到了行业顶尖水平(SOTA)。在同等参数规模的模型中,“子曰4”在处理带图表的数学题、物理题等高难度视觉数理问…