iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance
Researchers have introduced iTryOn, a new framework designed to enhance interactive virtual try-on experiences in videos. This system addresses the limitations of current methods by enabling subjects to actively interact with their clothing, a feature previously overlooked. iTryOn utilizes a video diffusion Transformer with a multi-level interaction injection mechanism, incorporating a 3D hand prior for spatial guidance and global/action captions for semantic understanding. AI
IMPACT Enables more dynamic and controllable virtual try-on experiences by allowing active garment interaction.