Youdao open-sources Confucius 4 multimodal LLM, cuts costs

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

NetEase Youdao has announced a significant upgrade to its "Confucius 4" large language model, now entering the multimodal era with support for text, image, and audio interactions. The company is open-sourcing its core multimodal and text-to-speech (TTS) models, aiming to reduce implementation costs for developers. The new model demonstrates state-of-the-art performance in visual mathematical reasoning and offers a 43.2% reduction in reasoning chain output length, leading to lower inference costs. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Lowers barriers for developers in multimodal and speech synthesis, potentially accelerating AI Agent development and adoption.

RANK_REASON This is a significant product release and open-source initiative from a major tech company in the AI space. [lever_c_demoted from significant: ic=1 ai=1.0]

Read on 雷峰网 (Leiphone) →

Youdao open-sources Confucius 4 multimodal LLM, cuts costs

COVERAGE [1]

雷峰网 (Leiphone) TIER_1 中文(ZH) · 2026-05-20 10:04

Deconstructing Youdao's 'Zhi Yue 4' Fully Open Source: How to Reduce Implementation Costs Through Refactored Chain-of-Thought?

<p>近日，网易有道宣布“子曰”大模型迎来 4.0 版本的全方位升级。“子曰4" 正式迈入全模态时代，不仅全面支持文本、图片、音频的融合交互，有道更宣布将核心的“多模态模型”与“语音合成（TTS）模型”正式开源。与此同时，翻译模型也迎来了深度的技术重构，翻译质量与效率实现双重提升。多模态模型视觉与数理斩获SOTA，纯文本数理难题性能行业领先据介绍，开源的“子曰4”多模态模型在 27B 参数规模上，面向教育场景，将支持视觉输入的数理能力拉到了行业顶尖水平（SOTA）。在同等参数规模的模型中，“子曰4”在处理带图表的数学题、物理题等高难度视觉数理问…

COVERAGE [1]

Deconstructing Youdao's 'Zhi Yue 4' Fully Open Source: How to Reduce Implementation Costs Through Refactored Chain-of-Thought?

RELATED ENTITIES

RELATED TOPICS