PulseAugur
EN
LIVE 03:21:51
中文(ZH) 24小时直播,只靠一张照片?虎牙实时多模态数字人VAM 1.0率先突围行业三堵墙

Huya launches VAM 1.0, a 24-hour real-time interactive AI digital human

Huya has launched VAM 1.0, a real-time multimodal digital human model that can engage in conversations, sing, dance, and play games, all from a single photo input. The model is built on a DiT architecture and can operate continuously for over 24 hours, outputting at a resolution of 480x832 at 28 frames per second. Unlike previous AI digital humans that often felt like pre-recorded videos, VAM 1.0 offers genuine real-time interaction, including the ability to handle interruptions, adapt to user preferences for address, and maintain conversational flow. The technology addresses key industry challenges such as temporal stability, interactive capabilities, and computational efficiency, aiming to enhance applications in live streaming, e-commerce, and news broadcasting. AI

IMPACT Sets a new benchmark for real-time interactive AI digital humans, potentially accelerating adoption in live streaming and virtual content creation.

RANK_REASON Product release from a significant AI lab (Huya) with a new model version (VAM 1.0). [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on 量子位 (QbitAI) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Huya launches VAM 1.0, a 24-hour real-time interactive AI digital human

COVERAGE [1]

  1. 量子位 (QbitAI) TIER_1 中文(ZH) · 一水 ·

    24-hour live broadcast, relying only on a photo? Huya's real-time multimodal digital human VAM 1.0 takes the lead in breaking through three industry walls

    能聊、能唱跳、能陪你玩游戏