新模型评估学生AI推理能力：构建、评判、引导

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-04 10:25

研究人员推出了CoRe-3，一个旨在评估学生与生成式AI推理能力的新能力模型。该模型将AI交互分解为三个不同的技能：构建（为AI定义任务）、评判（评估AI的输出）和引导（迭代地指导AI）。其目标是超越简单的提示评分，实现对教育中AI生产性使用的更细致的理解。 AI

影响为教育工作者提供了一个评估和改进学生与AI工具批判性互动能力的框架。

排序理由该集群包含一篇学术论文，详细介绍了用于评估AI推理能力的新模型和平台。

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Alexander Apartsin, Yehudit Aperstein · 2026-06-05 04:00

构建、评判、引导：一项可评估的能力模型，用于教导学生使用生成式AI进行推理

arXiv:2606.05983v1 Announce Type: cross Abstract: Generative AI makes answers easy and understanding hard, and uncritical use invites cognitive offloading. Schools still measure unaided performance, yet the real task is to produce good work with AI: framing an ill-defined task, j…
arXiv cs.CL TIER_1 English(EN) · Yehudit Aperstein · 2026-06-04 10:25

构建、评判、引导：一项可评估的能力模型，用于教导学生使用生成式AI进行推理

Generative AI makes answers easy and understanding hard, and uncritical use invites cognitive offloading. Schools still measure unaided performance, yet the real task is to produce good work with AI: framing an ill-defined task, judging the output, and steering the model toward a…