Alibaba's Qwen team has released Qwen3.5-Omni, a new generation of omnimodal large language models capable of processing text, images, audio, and audio-visual content. This series features models named Plus, Flash, and Light, all supporting a 256k context window and capable of handling over 10 hours of audio. The architecture utilizes a Hybrid-Attention Mixture-of-Experts (MoE) approach for both its reasoning and generation components. AI
IMPACT Expands LLM capabilities into native audio and video processing, potentially enabling more sophisticated AI agents and applications.
RANK_REASON Frontier-lab model release with system card. [lever_c_demoted from frontier_release: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →