PulseAugur
EN
LIVE 11:48:40

New diffusion model synthesizes novel human views and poses from single image

Researchers have developed a novel diffusion model for synthesizing novel human views and poses from a single image. This approach addresses limitations in existing methods, such as handling complex poses with ambiguous 2D keypoints or inaccuracies in generalizable human NeRFs for occluded parts. The proposed model incorporates 3D human priors, specifically 3D normal maps and color prompts, as conditional inputs to guide the denoising process. This allows for high-quality synthesis, including previously unseen or occluded human body parts, and includes a self-reconstruction-based refinement for enhanced detail. Experimental results on public datasets indicate superior performance and generalization capabilities compared to previous methods. AI

IMPACT This research could advance the capabilities of AI in generating realistic human imagery for applications like virtual try-on, gaming, and content creation.

RANK_REASON Academic paper detailing a new model and methodology. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CV →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New diffusion model synthesizes novel human views and poses from single image

COVERAGE [1]

  1. arXiv cs.CV TIER_1 English(EN) · Shenjian Gong, Kangkan Wang, Shanshan Zhang, Jian Yang ·

    One-Shot Novel View and Pose Human Image Synthesis via 3D Prior Guided Diffusion Model

    arXiv:2606.19718v1 Announce Type: new Abstract: This paper addresses the challenge of one-shot novel view and pose human image synthesis. The existing methods transfer the reference human image to a target pose using a set of 2D pose keypoints or synthesize human images based on …