PulseAugur
实时 18:07:54
English(EN) SpatialGrammar: A Domain-Specific Language for LLM-Based 3D Indoor Scene Generation

新的LLM技术和基准推动3D室内场景生成

研究人员开发了使用AI生成3D室内场景的新方法,解决了空间错误和数据稀缺等挑战。一种方法SpatialGrammar引入了一种领域特定语言来表示布局,并使用带有编译器反馈的闭环系统来确保物理合理性和约束检查。另一种方法CasLayout采用级联扩散框架,将场景生成分解为子阶段,从而更好地集成LLM和VLM,并提高关系可控性。此外,还引入了一个新的基准C-Bench和O-Bench,以更全面地评估引导式扩散模型。 AI

影响 新技术和评估框架旨在提高AI生成的3D室内环境的保真度和可控性。

排序理由 多篇研究论文介绍了AI驱动的3D室内场景生成的新方法和基准。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 6 个来源。 我们如何撰写摘要 →

新的LLM技术和基准推动3D室内场景生成

报道来源 [6]

  1. arXiv cs.AI TIER_1 English(EN) · Song Tang, Kaiyong Zhao, Yuliang Li, Qingsong Yan, Penglei Sun, Junyi Zou, Qiang Wang, Xiaowen Chu ·

    SpatialGrammar:一种用于基于LLM的3D室内场景生成的领域特定语言

    arXiv:2604.27555v1 Announce Type: new Abstract: Automatically generating interactive 3D indoor scenes from natural language is crucial for virtual reality, gaming, and embodied AI. However, existing LLM-based approaches often suffer from spatial errors and collisions, in part bec…

  2. Hugging Face Daily Papers TIER_1 English(EN) ·

    CasLayout:用于室内场景合成的级联3D布局扩散,具有隐式关系建模

    Synthesizing realistic 3D indoor scenes remains challenging due to data scarcity and the difficulty of simultaneously enforcing global architectural constraints and local semantic consistency. Existing approaches often overlook structural boundaries or rely on fully connected rel…

  3. Hugging Face Daily Papers TIER_1 English(EN) ·

    通过统一的语义空间评估在封闭和开放环境中对布局引导扩散模型进行基准测试

    Evaluating layout-guided text-to-image generative models requires assessing both semantic alignment with textual prompts and spatial fidelity to prescribed layouts. Assessing layout alignment requires collecting fine-grained annotations, which is costly and labor-intensive. Conse…

  4. arXiv cs.CV TIER_1 English(EN) · Yingrui Wu, Youkang Kong, Mingyang Zhao, Weize Quan, Dong-Ming Yan, Yang Liu ·

    CasLayout:用于室内场景合成的级联3D布局扩散,具有隐式关系建模

    arXiv:2604.27361v1 Announce Type: new Abstract: Synthesizing realistic 3D indoor scenes remains challenging due to data scarcity and the difficulty of simultaneously enforcing global architectural constraints and local semantic consistency. Existing approaches often overlook stru…

  5. arXiv cs.CV TIER_1 English(EN) · Luca Parolari, Nicla Faccioli, Lamberto Ballan ·

    通过统一的语义空间评估在封闭和开放设置中对布局引导扩散模型进行基准测试

    arXiv:2604.25358v1 Announce Type: new Abstract: Evaluating layout-guided text-to-image generative models requires assessing both semantic alignment with textual prompts and spatial fidelity to prescribed layouts. Assessing layout alignment requires collecting fine-grained annotat…

  6. arXiv cs.CV TIER_1 English(EN) · Lamberto Ballan ·

    通过统一的语义空间评估在封闭和开放环境中对布局引导扩散模型进行基准测试

    Evaluating layout-guided text-to-image generative models requires assessing both semantic alignment with textual prompts and spatial fidelity to prescribed layouts. Assessing layout alignment requires collecting fine-grained annotations, which is costly and labor-intensive. Conse…