Kwai has released Keye-VL-2.0-30B-A3B, a new 30 billion parameter multimodal model designed for long-video understanding and agent capabilities. This model incorporates DSA attention, a novel technique aimed at enhancing its ability to process and interpret extended video content. The release positions Keye-VL-2.0-30B-A3B as a flagship model in the Keye series, focusing on advancing multimodal AI applications. AI
IMPACT Introduces a new multimodal model with a focus on long-video understanding and agent capabilities.
RANK_REASON This is a release of a new model with technical details, but not from a top-tier frontier lab. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →