English(EN) SceneConductor: 3D Scene Generation from Single Image with Multi-Agent Orchestration

新方法使用多智能体系统生成和编辑3D室内场景

作者 PulseAugur 编辑部 · [7 个来源] · 2026-06-07 01:38

研究人员开发了生成和编辑3D室内场景的新方法。SceneConductor使用多智能体协同框架将过程分解为初始化、环境构建和精炼阶段，提高了几何精度和真实感。AccioScene采用图扩散和交互驱动的批评者，从文本提示创建连贯的3D场景，侧重于功能合理性和人机交互。HDSL引入了一种用于结构化场景表示的分层领域特定语言，使LLM智能体能够通过局部修订更有效地生成和编辑场景。 AI

影响这些在3D场景生成和编辑方面的进展可以加速游戏、模拟和建筑设计的虚拟环境的开发。

排序理由多篇研究论文介绍了3D场景生成和编辑的新颖方法。

在 arXiv cs.MA (Multiagent) 阅读 →

AI 生成摘要 · Google Gemini · 来自 7 个来源。我们如何撰写摘要 →

报道来源 [7]

arXiv cs.LG TIER_1 English(EN) · Yao Wei, Matteo Toso, Pietro Morerio, Changjae Oh, Michael Ying Yang, Alessio Del Bue · 2026-06-09 04:00

AccioScene：通过图扩散和交互式批评进行组合式3D场景生成

arXiv:2502.06819v2 Announce Type: replace Abstract: This paper presents a framework for generating 3D indoor scenes from text prompts. Existing methods often formulate scene synthesis as an object layout prediction problem conditioned on a single input modality, such as a text de…
arXiv cs.AI TIER_1 English(EN) · Jeonghwan Kim, Yushi Lan, Yongwei Chen, Hieu Trung Nguyen, Chuanyu Pan, Xingang Pan · 2026-06-09 04:00

SceneConductor：通过多智能体编排从单张图像生成3D场景

arXiv:2606.08402v1 Announce Type: cross Abstract: Generating complete 3D scenes from a single image requires inferring globally consistent geometry, object relationships, and environmental context from inherently ambiguous visual evidence. Despite recent progress in joint layout-…
arXiv cs.MA (Multiagent) TIER_1 English(EN) · Xingang Pan · 2026-06-07 01:38

SceneConductor：通过多代理编排从单张图像生成3D场景

Generating complete 3D scenes from a single image requires inferring globally consistent geometry, object relationships, and environmental context from inherently ambiguous visual evidence. Despite recent progress in joint layout-and-mesh generation, existing methods often rely o…
arXiv cs.CV TIER_1 English(EN) · Xinnan Zhu, Ruijie Xu, Jiayu Ying, Daoguo Dong, Jiachen Xu, Yuan Xie, Xin Tan · 2026-06-12 04:00

JointEdit3D: Feed-Forward 3D Scene Editing in a Unified Latent Space

arXiv:2606.13345v1 Announce Type: new Abstract: Existing 3D scene editing methods typically rely on per-scene optimization over explicit 3D representations or cascaded edit-and-reconstruct pipelines, resulting in high test-time cost, limited 3D awareness, and structural inconsist…
arXiv cs.CV TIER_1 English(EN) · Xin Tan · 2026-06-11 13:35

JointEdit3D: Feed-Forward 3D Scene Editing in a Unified Latent Space

Existing 3D scene editing methods typically rely on per-scene optimization over explicit 3D representations or cascaded edit-and-reconstruct pipelines, resulting in high test-time cost, limited 3D awareness, and structural inconsistencies. To couple appearance synthesis and geome…
arXiv cs.CV TIER_1 English(EN) · Letian Li, Chao Shen, Shuzhao Xie, Chenghao Gu, ZhengXiao He, Yu Meng, Xin Yang, Wenyuan Jiang, Zhi Wang · 2026-06-09 04:00

HDSL：用于结构化3D室内场景生成和LLM代理本地化编辑的分层领域特定语言

arXiv:2606.09738v1 Announce Type: new Abstract: Text-driven indoor scene generation and editing require an intermediate representation that language models can both produce and revise. Existing LLM-based systems often rely on scene graphs or global constraint lists, which are com…
arXiv cs.CV TIER_1 English(EN) · Zhi Wang · 2026-06-08 17:02

HDSL：用于结构化3D室内场景生成和LLM代理本地化编辑的分层领域特定语言

Text-driven indoor scene generation and editing require an intermediate representation that language models can both produce and revise. Existing LLM-based systems often rely on scene graphs or global constraint lists, which are compact but underspecify local geometry and make in…

报道来源 [7]

AccioScene：通过图扩散和交互式批评进行组合式3D场景生成

SceneConductor：通过多智能体编排从单张图像生成3D场景

SceneConductor：通过多代理编排从单张图像生成3D场景

JointEdit3D: Feed-Forward 3D Scene Editing in a Unified Latent Space

JointEdit3D: Feed-Forward 3D Scene Editing in a Unified Latent Space

HDSL：用于结构化3D室内场景生成和LLM代理本地化编辑的分层领域特定语言

HDSL：用于结构化3D室内场景生成和LLM代理本地化编辑的分层领域特定语言

相关实体

相关话题