PulseAugur
EN
LIVE 22:04:50

Stable Diffusion developer polls users on new VAE release

A developer is seeking community input on the potential release of a new VAE (Variational Autoencoder) for Stable Diffusion models. The developer has achieved an LPIPS score of 0.490 with a 32-channel variant, aiming for a theoretical limit of 0.40, and is gauging interest from tinkerers and fine-tuners. The decision to continue development or pivot to retraining for current SD models will be influenced by the community's response. AI

IMPACT This is a niche discussion among Stable Diffusion users about a potential VAE improvement, with minimal direct industry impact.

RANK_REASON This is a user poll about a potential technical release, not an actual release or significant research.

Read on r/StableDiffusion →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/StableDiffusion TIER_2 English(EN) · /u/lostinspaz ·

    At what quality would you be interested in a new vae for sd class models?

    <!-- SC_OFF --><div class="md"><p>current vae performance as rated by lpips scores. </p> <p>original sd vae: 1.2</p> <p>sdxl vae: 0.9</p> <p>qwen 2: 0.35</p> <p>flux2: 0.24</p> <p>Trouble with the last two is they do funky stuff making them completely incompatible with the early …