PulseAugur
EN
LIVE 20:42:24
中文(ZH) Om AI联汇发布VLX:全球首个面向物理世界的端侧流式多模态模型

Om AI unveils VLX, a streaming multimodal model for real-world AI

Om AI has launched VLX, a series of models designed for real-time interaction with the physical world. Unlike traditional models that process video frames offline, VLX uses a novel "streaming multimodal" architecture for continuous, millisecond-level perception and action. The series includes VLX-Flow for ongoing environmental awareness, VLX-Seek for precise spatial localization, and VLX-Go for direct translation of visual understanding into robotic actions. AI

IMPACT Enables real-time, continuous perception and action for edge devices, potentially accelerating embodied AI development.

RANK_REASON Frontier-lab model release with system card. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on 量子位 (QbitAI) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Om AI unveils VLX, a streaming multimodal model for real-world AI

COVERAGE [1]

  1. 量子位 (QbitAI) TIER_1 中文(ZH) · 量子位的朋友们 ·

    Om AI Lianhui Releases VLX: The World's First Edge-Side Streaming Multimodal Model for the Physical World

    物理世界AI的下一步