Brief · PulseAugur

RESEARCH · Hugging Face Daily Papers English(EN) · 1w · [3 sources]

Thinking in Blender: Staged Executable Inverse Graphics with Vision-Language Models

Researchers have developed a new framework called Staged Executable Inverse Graphics (SEIG) that uses vision-language models to reconstruct 3D scenes from single images. This method generates editable Blender programs, allowing for manipulation of geometry, materials, and lighting without specialized 3D models or multi-view data. The staged reconstruction approach significantly enhances fidelity, enabling various downstream applications. AI

IMPACT Enables more intuitive 3D scene creation and editing from single images, potentially impacting content creation and simulation.

Blender
vision-language models
Staged Executable Inverse Graphics
Hugging Face
arXiv