NetEase Youdao has announced a significant upgrade to its "Confucius 4" large language model, now entering the multimodal era with support for text, image, and audio interactions. The company is open-sourcing its core multimodal and text-to-speech (TTS) models, aiming to reduce implementation costs for developers. The new model demonstrates state-of-the-art performance in visual mathematical reasoning and offers a 43.2% reduction in reasoning chain output length, leading to lower inference costs. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT Lowers barriers for developers in multimodal and speech synthesis, potentially accelerating AI Agent development and adoption.
RANK_REASON This is a significant product release and open-source initiative from a major tech company in the AI space. [lever_c_demoted from significant: ic=1 ai=1.0]