PulseAugur
EN
LIVE 02:31:12

PixelDiT diffusion transformer released with Qwen encoder support

A new diffusion transformer model called PixelDiT has been released, featuring 1.3 billion parameters and operating directly in pixel space without a VAE. This model is designed to be efficient, requiring only 4GB of VRAM, and is fully compatible with the Hugging Face Diffusers library. It also incorporates support for the Qwen encoder, enhancing its capabilities. AI

IMPACT Provides a new, efficient diffusion model for image generation tasks.

RANK_REASON Release of a new open-source model with technical specifications. [lever_c_demoted from research: ic=1 ai=1.0]

Read on r/StableDiffusion →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

PixelDiT diffusion transformer released with Qwen encoder support

COVERAGE [1]

  1. r/StableDiffusion TIER_2 English(EN) · /u/madtune22 ·

    PixelDiT — 1.3B pixel-space diffusion transformer, no VAE, 4GB VRAM, now 100% diffusers compatible with Qwen encoder support

    <table> <tr><td> <a href="https://www.reddit.com/r/StableDiffusion/comments/1tuco68/pixeldit_13b_pixelspace_diffusion_transformer_no/"> <img alt="PixelDiT — 1.3B pixel-space diffusion transformer, no VAE, 4GB VRAM, now 100% diffusers compatible with Qwen encoder support" src="htt…