English(EN) Math Education Digital Shadows for facilitating learning with LLMs: Math performance, anxiety and confidence in simulated students and AIs

新的MEDS数据集描绘了大型语言模型在数学推理、偏见和态度

作者 PulseAugur 编辑部 · [2 个来源] · 2026-04-30 09:08

研究人员推出了MEDS（数学教育数字阴影），一个旨在评估大型语言模型在数学方面的表现并识别潜在偏见的新数据集。MEDS包含14个大型语言模型中的28,000个角色，模拟了人类和AI助手的互动。它超越了传统的基准测试，纳入了自我效能感、数学焦虑和认知网络以及熟练度分数等指标。 AI

影响为评估大型语言模型的数学能力和偏见提供了一个新数据集，有助于开发更安全的AI导师。

排序理由该集群描述了在arXiv上发布的新数据集和研究论文。

在 arXiv cs.LG 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Naomi Esposito, Anthony Tricarico, Luisa Porzio, Ali Aghazadeh Ardebili, Massimo Stella · 2026-05-01 04:00

用于促进LLM学习的数学教育数字阴影：模拟学生和AI的数学表现、焦虑和信心

arXiv:2604.27618v1 Announce Type: new Abstract: To enhance LLMs' impact on math education, we need data on their mathematical prowess and biases across prompts. To fill this gap, we introduce MEDS (Math Education Digital Shadows) as a dataset mapping how large language models rea…
arXiv cs.LG TIER_1 English(EN) · Massimo Stella · 2026-04-30 09:08

促进学习的数学教育数字阴影与大型语言模型：模拟学生和人工智能的数学表现、焦虑和信心

To enhance LLMs' impact on math education, we need data on their mathematical prowess and biases across prompts. To fill this gap, we introduce MEDS (Math Education Digital Shadows) as a dataset mapping how large language models reason about and report mathematics across human- a…

报道来源 [2]

用于促进LLM学习的数学教育数字阴影：模拟学生和AI的数学表现、焦虑和信心

促进学习的数学教育数字阴影与大型语言模型：模拟学生和人工智能的数学表现、焦虑和信心

相关实体

相关话题