PulseAugur
LIVE 08:19:47
research · [2 sources] ·
0
research

End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer

Researchers have developed an end-to-end training pipeline for autoregressive image generation that jointly optimizes reconstruction and generation. This approach allows for direct supervision of the visual tokenizer from the generation results, differing from previous methods that trained tokenizers and generative models separately. The new model leverages vision foundation models to enhance 1D tokenizers and has achieved a state-of-the-art FID score of 1.48 on ImageNet 256x256 generation without guidance. AI

Summary written by None from 2 sources. How we write summaries →

IMPACT Introduces a novel end-to-end training approach for image generation models, potentially improving efficiency and performance.

RANK_REASON Academic paper detailing a new method for autoregressive image generation.

Read on arXiv cs.CV →

COVERAGE [2]

  1. arXiv cs.CV TIER_1 · Wenda Chu, Bingliang Zhang, Jiaqi Han, Yizhuo Li, Linjie Yang, Yisong Yue, Qiushan Guo ·

    End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer

    arXiv:2605.00503v1 Announce Type: new Abstract: Autoregressive image modeling relies on visual tokenizers to compress images into compact latent representations. We design an end-to-end training pipeline that jointly optimizes reconstruction and generation, enabling direct superv…

  2. arXiv cs.CV TIER_1 · Qiushan Guo ·

    End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer

    Autoregressive image modeling relies on visual tokenizers to compress images into compact latent representations. We design an end-to-end training pipeline that jointly optimizes reconstruction and generation, enabling direct supervision from generation results to the tokenizer. …