NetEase Youdao has launched Confucius4-TTS, a new large model TTS engine that supports 14 languages. This engine is notable for its ability to clone voices with zero-shot learning, requiring only 3 seconds of audio and no reference text to replicate a speaker's tone and emotion. The model is fully open-source, with weights and tools available for local deployment, aiming to reduce costs and barriers for creators and developers in areas like digital humans and cross-lingual communication. AI
IMPACT Enables low-cost, high-quality voice cloning and cross-lingual synthesis, potentially accelerating adoption in digital content creation and global communication.
RANK_REASON Frontier-lab model release with system card [lever_c_demoted from frontier_release: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →