PulseAugur
EN
LIVE 20:40:15

New diffusion model framework enhances pose-guided person image synthesis

Researchers have developed a new framework called Fusion Embedding for PGPIS using a Diffusion Model (FPDM) to improve the synthesis of person images based on specified poses. This method explicitly aligns fused source-pose embeddings with target image embeddings through contrastive learning, using the learned fusion embedding as a conditioning signal for generation. FPDM integrates an Image-Pose Fusion module to learn these aligned embeddings, guiding a conditional diffusion model with source appearance, target pose, and the fusion embedding. Experiments on benchmark datasets show FPDM enhances texture fidelity and consistency across pose and source variations. AI

IMPACT Improves fidelity and consistency in AI-generated human images for applications like virtual try-on and digital avatars.

RANK_REASON This is a research paper describing a new method for image synthesis. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.AI TIER_1 English(EN) · Donghwna Lee, Kirok Kim, Jisu Lee, Kyungha Min, Wooju Kim ·

    Fusion Embedding for Pose-Guided Person Image Synthesis with Diffusion Model

    arXiv:2412.07333v2 Announce Type: replace-cross Abstract: Pose-Guided Person Image Synthesis (PGPIS) aims to generate human images in specified poses while preserving the identity and appearance of a source image. This technology facilitates diverse applications, including virtua…