LLMs struggle with visual reasoning in engineering statics problems

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-26 04:00

一项新发表在arXiv上的研究调查了大型语言模型（LLMs）的解决问题能力，特别关注工程教育中的静力学问题。研究人员使用ChatGPT进行了模型蒸馏过程，创建了一个包含25个纯文本静力学问题的数据集，以及另外两个包含图表和修改数值的数据集。研究结果表明，虽然LLMs在基于文本的静力学问题上表现良好，但在引入图表时其准确性显著下降，这表明在多步推理和整合视觉信息方面存在困难。 AI

影响强调了LLMs整合视觉信息和进行多步推理的能力限制，指出了人工智能在工程教育未来发展方向。

排序理由该集群包含一篇详细调查LLM能力的 연구 논문. [lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

LLMs struggle with visual reasoning in engineering statics problems

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Tanner Culleton, Hung-Fu Chang · 2026-06-26 04:00

调查LLM解决问题能力——一项关于静力学问题的研究

arXiv:2606.26103v1 Announce Type: cross Abstract: Large Language Models (LLMs) have rapidly influenced many aspects of society, particularly education, due to their demonstrated ability to complete assignments and examinations across a wide range of subjects. Although prior studi…

报道来源 [1]

调查LLM解决问题能力——一项关于静力学问题的研究

相关实体

相关话题