English(EN) The Sword, Shield, and Achilles' Heel: Characterizing the Linguistic Inductive Bias of Large Language Models for Spatial Reasoning in Navigation Planning

新的大型语言模型研究增强了超越符号模式的空间推理能力

作者 PulseAugur 编辑部 · [9 个来源] · 2026-05-29 15:09

研究人员正在开发新的方法来提高大型语言模型（LLM）的空间推理能力，方法是超越符号模式匹配，实现真正的几何理解。一种方法引入了空间语言模型（SLM），它将位置视为一等模态，并使用专门的数据集和基准进行训练和评估。另一种方法，想象感知令牌（IPT），通过允许多模态模型推断未见的空间配置来增强它们，从而提高路径跟踪和多视图计数等任务的性能。此外，研究还在调查语言偏差的影响以及度量空间接地对LLM空间预测的重要性。 AI

影响这些进展旨在为大型语言模型配备更强大的几何和想象空间推理能力，超越肤浅的模式匹配。

排序理由多篇研究论文介绍了改进大型语言模型空间推理的新技术和基准。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 9 个来源。我们如何撰写摘要 →

报道来源 [9]

arXiv cs.AI TIER_1 English(EN) · Chen Chu, Bita Azarijoo, Li Xiong, Khurram Shafique, Cyrus Shahabi · 2026-06-04 04:00

From Symbolic to Geometric: Enabling Spatial Reasoning in Large Language Models

arXiv:2606.04381v1 Announce Type: cross Abstract: Recent large language models (LLMs) often appear to exhibit spatial reasoning ability; however, this capability is largely \emph{symbolic}, arising from pattern matching over spatial language rather than true \emph{geometric} reas…
arXiv cs.AI TIER_1 English(EN) · Mahtab Bigverdi, Lindsey Li, Weikai Huang, Yiming Liu, Jaemin Cho, Jieyu Zhang, Tuhin Kundu, Chris Dangjoo Kim, Zelun Luo, Linda Shapiro, Ranjay Krishna · 2026-06-03 04:00

富有想象力的感知令牌增强了多模态语言模型中的空间推理能力

arXiv:2606.03988v1 Announce Type: new Abstract: Vision language models (VLMs) excel at many tasks but still struggle with spatial reasoning when critical information is not directly observable. Many such problems require imaginative perception: inferring what would be seen from a…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-06-03 02:54

From Symbolic to Geometric: Enabling Spatial Reasoning in Large Language Models

Recent large language models (LLMs) often appear to exhibit spatial reasoning ability; however, this capability is largely \emph{symbolic}, arising from pattern matching over spatial language rather than true \emph{geometric} reasoning over space. Because LLMs operate on discrete…
arXiv cs.AI TIER_1 English(EN) · Ranjay Krishna · 2026-06-02 17:59

富有想象力的感知令牌增强了多模态语言模型中的空间推理能力

Vision language models (VLMs) excel at many tasks but still struggle with spatial reasoning when critical information is not directly observable. Many such problems require imaginative perception: inferring what would be seen from an unseen viewpoint, tracing paths through occlud…
arXiv cs.IR (Information Retrieval) TIER_1 English(EN) · Shuigeng Zhou · 2026-06-02 14:47

When Does Latent Reasoning Help? MeRa: Metric-Space Bias for Spatial Prediction

Latent reasoning has improved sequential recommendation by iteratively refining representations before prediction, but does it help spatial prediction? We find that the answer depends on whether reasoning is grounded in the underlying metric space. Without such grounding, latent …
arXiv cs.CL TIER_1 English(EN) · Chuang Ma, Qianying Liu, Tomoyuki Obuchi, Fei Cheng, Wang Yang, Sudong Cai, Shuyuan Zheng, Akiko Aizawa, Sadao Kurohashi · 2026-06-02 04:00

Mechanistic Diagnostics of Spatial Lexical Bias in Multimodal Large Language Model Spatial Reasoning

arXiv:2606.01914v1 Announce Type: new Abstract: Multimodal large language models (MLLMs) remain unreliable on spatial multiple-choice questions, and their failures are often attributed to poorly attended visual information. In this work, we identify a complementary failure mode, …
arXiv cs.CL TIER_1 English(EN) · Sadao Kurohashi · 2026-06-01 08:49

Mechanistic Diagnostics of Spatial Lexical Bias in Multimodal Large Language Model Spatial Reasoning

Multimodal large language models (MLLMs) remain unreliable on spatial multiple-choice questions, and their failures are often attributed to poorly attended visual information. In this work, we identify a complementary failure mode, spatial lexical bias: adding a spatial relation …
arXiv cs.AI TIER_1 English(EN) · Xudong Zhang, Jian Yang, Shengkai Wang, Jiangpeng Tian, Shaowen Chen, Xian Wei, Ke Li, Xiong You · 2026-06-01 04:00

利剑、坚盾与致命弱点：大型语言模型在导航规划中空间推理的语言归纳偏差特征分析

arXiv:2605.31404v1 Announce Type: cross Abstract: Large Language Model (LLM)-based navigation systems commonly construct explicit spatial representations (e.g., topological graphs, semantic raster maps) and translate them into textual descriptions as LLMs' inputs. However, the li…
arXiv cs.AI TIER_1 English(EN) · Xiong You · 2026-05-29 15:09

利剑、坚盾与致命弱点：大型语言模型在导航规划中空间推理的语言归纳偏差特征分析

Large Language Model (LLM)-based navigation systems commonly construct explicit spatial representations (e.g., topological graphs, semantic raster maps) and translate them into textual descriptions as LLMs' inputs. However, the linguistic structures of such text-based spatial rep…

报道来源 [9]

相关实体

相关话题