PulseAugur
EN
LIVE 15:17:01
中文(ZH) CVPR 2026最热方向,被一家杭州团队率先跑进了端侧!

Om AI unveils VLX: first on-device streaming multimodal model series

Om AI, a team from Hangzhou, has released VLX, a series of three end-to-end streaming multimodal models designed for real-world, on-device applications. The models, VLX-Flow, VLX-Seek, and VLX-Go, enable continuous perception, precise localization, and action decision-making, forming a closed-loop system for physical world interaction. Unlike traditional cloud-based models, VLX is engineered from the ground up for edge devices like phones, drones, and robots, prioritizing efficiency and real-time responsiveness. AI

IMPACT Enables more capable and responsive AI agents on edge devices, potentially accelerating robotics and embodied AI development.

RANK_REASON New multimodal model series released by a research team, focusing on novel on-device streaming capabilities. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on 量子位 (QbitAI) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Om AI unveils VLX: first on-device streaming multimodal model series

COVERAGE [1]

  1. 量子位 (QbitAI) TIER_1 中文(ZH) · henry ·

    The hottest direction at CVPR 2026, pioneered on the edge by a Hangzhou team!

    VLM- R1之后再次出手!全球首个端侧流式多模态来了!