PulseAugur
实时 13:10:13
English(EN) SceneConductor: 3D Scene Generation from Single Image with Multi-Agent Orchestration

新方法使用多智能体系统生成和编辑3D室内场景

研究人员开发了生成和编辑3D室内场景的新方法。SceneConductor使用多智能体协同框架将过程分解为初始化、环境构建和精炼阶段,提高了几何精度和真实感。AccioScene采用图扩散和交互驱动的批评者,从文本提示创建连贯的3D场景,侧重于功能合理性和人机交互。HDSL引入了一种用于结构化场景表示的分层领域特定语言,使LLM智能体能够通过局部修订更有效地生成和编辑场景。 AI

影响 这些在3D场景生成和编辑方面的进展可以加速游戏、模拟和建筑设计的虚拟环境的开发。

排序理由 多篇研究论文介绍了3D场景生成和编辑的新颖方法。

在 arXiv cs.MA (Multiagent) 阅读 →

AI 生成摘要 · Google Gemini · 来自 7 个来源。 我们如何撰写摘要 →

报道来源 [7]

  1. arXiv cs.LG TIER_1 English(EN) · Yao Wei, Matteo Toso, Pietro Morerio, Changjae Oh, Michael Ying Yang, Alessio Del Bue ·

    AccioScene:通过图扩散和交互式批评进行组合式3D场景生成

    arXiv:2502.06819v2 Announce Type: replace Abstract: This paper presents a framework for generating 3D indoor scenes from text prompts. Existing methods often formulate scene synthesis as an object layout prediction problem conditioned on a single input modality, such as a text de…

  2. arXiv cs.AI TIER_1 English(EN) · Jeonghwan Kim, Yushi Lan, Yongwei Chen, Hieu Trung Nguyen, Chuanyu Pan, Xingang Pan ·

    SceneConductor:通过多智能体编排从单张图像生成3D场景

    arXiv:2606.08402v1 Announce Type: cross Abstract: Generating complete 3D scenes from a single image requires inferring globally consistent geometry, object relationships, and environmental context from inherently ambiguous visual evidence. Despite recent progress in joint layout-…

  3. arXiv cs.MA (Multiagent) TIER_1 English(EN) · Xingang Pan ·

    SceneConductor:通过多代理编排从单张图像生成3D场景

    Generating complete 3D scenes from a single image requires inferring globally consistent geometry, object relationships, and environmental context from inherently ambiguous visual evidence. Despite recent progress in joint layout-and-mesh generation, existing methods often rely o…

  4. arXiv cs.CV TIER_1 English(EN) · Xinnan Zhu, Ruijie Xu, Jiayu Ying, Daoguo Dong, Jiachen Xu, Yuan Xie, Xin Tan ·

    JointEdit3D: Feed-Forward 3D Scene Editing in a Unified Latent Space

    arXiv:2606.13345v1 Announce Type: new Abstract: Existing 3D scene editing methods typically rely on per-scene optimization over explicit 3D representations or cascaded edit-and-reconstruct pipelines, resulting in high test-time cost, limited 3D awareness, and structural inconsist…

  5. arXiv cs.CV TIER_1 English(EN) · Xin Tan ·

    JointEdit3D: Feed-Forward 3D Scene Editing in a Unified Latent Space

    Existing 3D scene editing methods typically rely on per-scene optimization over explicit 3D representations or cascaded edit-and-reconstruct pipelines, resulting in high test-time cost, limited 3D awareness, and structural inconsistencies. To couple appearance synthesis and geome…

  6. arXiv cs.CV TIER_1 English(EN) · Letian Li, Chao Shen, Shuzhao Xie, Chenghao Gu, ZhengXiao He, Yu Meng, Xin Yang, Wenyuan Jiang, Zhi Wang ·

    HDSL:用于结构化3D室内场景生成和LLM代理本地化编辑的分层领域特定语言

    arXiv:2606.09738v1 Announce Type: new Abstract: Text-driven indoor scene generation and editing require an intermediate representation that language models can both produce and revise. Existing LLM-based systems often rely on scene graphs or global constraint lists, which are com…

  7. arXiv cs.CV TIER_1 English(EN) · Zhi Wang ·

    HDSL:用于结构化3D室内场景生成和LLM代理本地化编辑的分层领域特定语言

    Text-driven indoor scene generation and editing require an intermediate representation that language models can both produce and revise. Existing LLM-based systems often rely on scene graphs or global constraint lists, which are compact but underspecify local geometry and make in…