English(EN) PixelU: A U-Shaped Transformer for Efficient End-to-End Pixel Diffusion

PixelU Transformer 提供高效的端到端像素扩散

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-26 06:39

研究人员推出 PixelU，这是一种新颖的 U 型扩散 Transformer，专为高效的端到端像素扩散而设计。该模型通过关注 $x$-预测范式而非 $v$-预测，挑战了像素空间扩散中复杂解码器的必要性。PixelU 利用零成本跳跃连接直接路由高频细节，并采用恒定通道空间下采样机制来分离低频语义。在 ImageNet 上的实验表明，PixelU 在计算成本显著降低的情况下，取得了与现有方法相比具有竞争力的 FID 分数。 AI

影响引入了一种计算效率更高的像素扩散模型方法，有可能加速生成图像合成领域的研究和开发。

排序理由该集群描述了一篇详细介绍新颖模型架构和技术的新学术论文。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CV TIER_1 English(EN) · Zipeng Guo, Lichen Ma, Yu He, Xiaolong Fu, Jingling Fu, Junshi Huang, Yan Li · 2026-06-29 04:00

PixelU: A U-Shaped Transformer for Efficient End-to-End Pixel Diffusion

arXiv:2606.27760v1 Announce Type: new Abstract: End-to-end pixel-space diffusion models bypass the lossy compression of Latent Diffusion Models (LDMs) but struggle to jointly model low-frequency semantics and high-frequency signals in high-dimensional space. Existing works heavil…
arXiv cs.CV TIER_1 English(EN) · Yan Li · 2026-06-26 06:39

PixelU：一种用于高效端到端像素扩散的U型Transformer

End-to-end pixel-space diffusion models bypass the lossy compression of Latent Diffusion Models (LDMs) but struggle to jointly model low-frequency semantics and high-frequency signals in high-dimensional space. Existing works heavily rely on complex pixel decoders to alleviate th…

报道来源 [2]

PixelU: A U-Shaped Transformer for Efficient End-to-End Pixel Diffusion

PixelU：一种用于高效端到端像素扩散的U型Transformer

相关实体

相关话题