PulseAugur
实时 01:42:53
English(EN) AmaraSpatial-10K: A Spatially and Semantically Aligned 3D Dataset for Spatial Computing and Embodied AI

新数据集旨在提高具身人工智能的语言多样性和空间对齐性

两个新数据集旨在通过解决现有数据的局限性来改进具身人工智能研究。一篇题为“具身人工智能数据集中的语言多样性有限”的论文审计了当前的语料库,发现它们经常使用重复的、模板化的命令,这表明需要更广泛的语言覆盖。另一篇题为“AmaraSpatial-10K”的论文介绍了一个包含超过10,000个合成3D资产的数据集,这些资产是按度量缩放和语义对齐的,专为在具身人工智能和机器人模拟中直接使用而设计。 AI

影响 新数据集解决了具身人工智能中的数据局限性,有可能提高模型性能并实现更复杂的模拟。

排序理由 两篇学术论文介绍了用于具身人工智能研究的新数据集和分析。

在 arXiv cs.CV 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

新数据集旨在提高具身人工智能的语言多样性和空间对齐性

报道来源 [2]

  1. arXiv cs.CL TIER_1 English(EN) · Selma Wanna, Agnes Luhtaru, Jonathan Salfity, Ryan Barron, Juston Moore, Cynthia Matuszek, Mitch Pryor ·

    Limited Linguistic Diversity in Embodied AI Datasets

    arXiv:2601.03136v2 Announce Type: replace Abstract: Language plays a critical role in Vision-Language-Action (VLA) models, yet the linguistic characteristics of the datasets used to train and evaluate these systems remain poorly documented. In this work, we present a systematic d…

  2. arXiv cs.CV TIER_1 English(EN) · Mohammad Sadegh Salehi, Alex Perkins, Igor Maurell, Ashkan Dabbagh, Raymond Wong ·

    AmaraSpatial-10K: A Spatially and Semantically Aligned 3D Dataset for Spatial Computing and Embodied AI

    arXiv:2604.23018v1 Announce Type: new Abstract: Web-scale 3D asset collections are abundant, but rarely deployment-ready. Assets ship with arbitrary metric scale, incorrect pivots and forward axes, brittle geometry, and textures that do not support relighting, which limits their …