PulseAugur
实时 14:47:19

End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer

Researchers have developed an end-to-end training pipeline for autoregressive image generation that jointly optimizes reconstruction and generation. This approach allows for direct supervision of the visual tokenizer from the generation results, differing from previous methods that trained tokenizers and generative models separately. The new model leverages vision foundation models to enhance 1D tokenizers and has achieved a state-of-the-art FID score of 1.48 on ImageNet 256x256 generation without guidance. AI

影响 Introduces a novel end-to-end training approach for image generation models, potentially improving efficiency and performance.

排序理由 Academic paper detailing a new method for autoregressive image generation.

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer

报道来源 [2]

  1. arXiv cs.CV TIER_1 English(EN) · Wenda Chu, Bingliang Zhang, Jiaqi Han, Yizhuo Li, Linjie Yang, Yisong Yue, Qiushan Guo ·

    End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer

    arXiv:2605.00503v1 Announce Type: new Abstract: Autoregressive image modeling relies on visual tokenizers to compress images into compact latent representations. We design an end-to-end training pipeline that jointly optimizes reconstruction and generation, enabling direct superv…

  2. arXiv cs.CV TIER_1 English(EN) · Qiushan Guo ·

    End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer

    Autoregressive image modeling relies on visual tokenizers to compress images into compact latent representations. We design an end-to-end training pipeline that jointly optimizes reconstruction and generation, enabling direct supervision from generation results to the tokenizer. …