Researchers have developed a novel diffusion model for synthesizing novel human views and poses from a single image. This approach addresses limitations in existing methods, such as handling complex poses with ambiguous 2D keypoints or inaccuracies in generalizable human NeRFs for occluded parts. The proposed model incorporates 3D human priors, specifically 3D normal maps and color prompts, as conditional inputs to guide the denoising process. This allows for high-quality synthesis, including previously unseen or occluded human body parts, and includes a self-reconstruction-based refinement for enhanced detail. Experimental results on public datasets indicate superior performance and generalization capabilities compared to previous methods. AI
IMPACT This research could advance the capabilities of AI in generating realistic human imagery for applications like virtual try-on, gaming, and content creation.
RANK_REASON Academic paper detailing a new model and methodology. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →