PixelU Transformer offers efficient end-to-end pixel diffusion

By PulseAugur Editorial · [2 sources] · 2026-06-26 06:39

Researchers have introduced PixelU, a novel U-shaped Diffusion Transformer designed for efficient end-to-end pixel diffusion. This model challenges the necessity of complex decoders in pixel-space diffusion by focusing on the $x$-prediction paradigm rather than $v$-prediction. PixelU utilizes zero-cost skip connections for direct routing of high-frequency details and a constant-channel spatial down-sampling mechanism to isolate low-frequency semantics. Experiments on ImageNet show PixelU achieving competitive FID scores with significantly reduced computational cost compared to existing methods. AI

IMPACT Introduces a more computationally efficient approach to pixel diffusion models, potentially accelerating research and development in generative image synthesis.

RANK_REASON The cluster describes a new academic paper detailing a novel model architecture and technique.

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

PixelU Transformer offers efficient end-to-end pixel diffusion

COVERAGE [2]

arXiv cs.CV TIER_1 English(EN) · Zipeng Guo, Lichen Ma, Yu He, Xiaolong Fu, Jingling Fu, Junshi Huang, Yan Li · 2026-06-29 04:00

PixelU: A U-Shaped Transformer for Efficient End-to-End Pixel Diffusion

arXiv:2606.27760v1 Announce Type: new Abstract: End-to-end pixel-space diffusion models bypass the lossy compression of Latent Diffusion Models (LDMs) but struggle to jointly model low-frequency semantics and high-frequency signals in high-dimensional space. Existing works heavil…
arXiv cs.CV TIER_1 English(EN) · Yan Li · 2026-06-26 06:39

PixelU: A U-Shaped Transformer for Efficient End-to-End Pixel Diffusion

End-to-end pixel-space diffusion models bypass the lossy compression of Latent Diffusion Models (LDMs) but struggle to jointly model low-frequency semantics and high-frequency signals in high-dimensional space. Existing works heavily rely on complex pixel decoders to alleviate th…

COVERAGE [2]

PixelU: A U-Shaped Transformer for Efficient End-to-End Pixel Diffusion

PixelU: A U-Shaped Transformer for Efficient End-to-End Pixel Diffusion

RELATED ENTITIES

RELATED TOPICS