New Benchmark Framework Simulates Imperfect Students with LLMs

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-25 08:54

研究人员引入了一个使用大型语言模型模拟不完美学生的创新框架，旨在辅助教师培训。所提出的方法使用显式的技能向量和基于提示的控制来引导 LLM 的行为，从而能够模拟具有特定保留和抑制能力的学生的行为。虽然初步结果证明了在结构化数学环境中诱导和测量选择性部分掌握的可行性，但可控性程度被发现取决于所使用的特定语言模型。 AI

影响这项研究可以实现更现实、更可控的由人工智能驱动的模拟，用于教师培训，从而改善教育实践。

排序理由该集群包含一篇学术论文，详细介绍了用于控制 LLM 行为的新研究框架和基准。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

New Benchmark Framework Simulates Imperfect Students with LLMs

报道来源 [2]

arXiv cs.AI TIER_1 English(EN) · Alexander Apartsin, Omri Sason, Yehudit Aperstein · 2026-05-26 04:00

Toward a Benchmark for Controllable Simulation of Imperfect Students with Large Language Models

arXiv:2605.25601v1 Announce Type: cross Abstract: Teacher education requires deliberate practice with learners who exhibit identifiable strengths, weaknesses, and partial mastery. Large language models could support such practice by simulating students with known skill components…
arXiv cs.CL TIER_1 English(EN) · Yehudit Aperstein · 2026-05-25 08:54

Toward a Benchmark for Controllable Simulation of Imperfect Students with Large Language Models

Teacher education requires deliberate practice with learners who exhibit identifiable strengths, weaknesses, and partial mastery. Large language models could support such practice by simulating students with known skill components, enabling teachers to rehearse explanations, diag…

报道来源 [2]

Toward a Benchmark for Controllable Simulation of Imperfect Students with Large Language Models

Toward a Benchmark for Controllable Simulation of Imperfect Students with Large Language Models

相关实体

相关话题