中文(ZH) 去掉 VAE 之后，商汤用 8B 参数重新定义了开源生图的上限

SenseTime's 8B model redefines open-source image generation

By PulseAugur Editorial · [1 sources] · 2026-05-31 08:14

SenseTime has released SenseNova U1, an 8B parameter open-source model that redefines image generation capabilities by removing the VAE component. This new architecture, called NEO-unify, enables end-to-end modeling of language and vision directly at the pixel level, eliminating information loss from compression. The model demonstrates state-of-the-art performance on various benchmarks, surpassing some closed-source models in its class, and is available under an Apache 2.0 license for commercial use. AI

IMPACT Sets a new benchmark for open-source image generation, potentially accelerating adoption of unified multimodal architectures.

RANK_REASON New model release from a significant AI lab (SenseTime) with novel architecture. [lever_c_demoted from frontier_release: ic=1 ai=1.0]

Read on 雷峰网 (Leiphone) →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

SenseTime's 8B model redefines open-source image generation

COVERAGE [1]

雷峰网 (Leiphone) TIER_1 中文(ZH) · 2026-05-31 08:14

After removing VAE, SenseTime redefines the upper limit of open-source image generation with 8B parameters

<section style="text-align: left; margin: 0px 16px; line-height: 1.75em; display: block;"><span style="text-align: justify; line-height: 1.75em; font-size: 15px; letter-spacing: 0.5px; font-family: Arial, Helvetica, sans-serif;">雷峰网文章开源一周多，</span><span lang="EN-US" style="text-a…

COVERAGE [1]

After removing VAE, SenseTime redefines the upper limit of open-source image generation with 8B parameters

RELATED ENTITIES

RELATED TOPICS