After removing VAE, SenseTime redefines the upper limit of open-source image generation with 8B parameters
SenseTime has released SenseNova U1, an 8B parameter open-source model that redefines image generation capabilities by removing the VAE component. This new architecture, called NEO-unify, enables end-to-end modeling of language and vision directly at the pixel level, eliminating information loss from compression. The model demonstrates state-of-the-art performance on various benchmarks, surpassing some closed-source models in its class, and is available under an Apache 2.0 license for commercial use. AI
IMPACT Sets a new benchmark for open-source image generation, potentially accelerating adoption of unified multimodal architectures.