Thinking in Blender: Staged Executable Inverse Graphics with Vision-Language Models
Researchers have developed a new framework called Staged Executable Inverse Graphics (SEIG) that uses vision-language models to reconstruct 3D scenes from single images. This method generates editable Blender programs, allowing for manipulation of geometry, materials, and lighting without specialized 3D models or multi-view data. The staged reconstruction approach significantly enhances fidelity, enabling various downstream applications. AI
IMPACT Enables more intuitive 3D scene creation and editing from single images, potentially impacting content creation and simulation.