Stitched Value Model for Diffusion Alignment
Researchers have developed StitchVM, a novel framework for aligning diffusion models with specific rewards like prompt fidelity. This method efficiently transfers reward models trained on clean images to handle noisy intermediate latents in diffusion processes. By stitching a pretrained pixel-space reward model to a frozen diffusion backbone, StitchVM creates a lightweight yet powerful value function for noisy latents. This approach significantly speeds up downstream tasks such as DPS and DiffusionNFT, while also reducing memory requirements. AI
IMPACT Enhances efficiency and reduces memory usage for diffusion model alignment tasks like DPS and DiffusionNFT.