English(EN) From Scoring to Explanations: Evaluating SHAP and LLM Rationales for Rubric-based Teaching Quality Assessment

新框架使用SHAP和LLM来解释教学质量分数

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-05 04:00

研究人员开发了一个新框架，用于解释自动化评分模型如何为复杂的语言表现（如课堂记录）分配质量评级。该框架结合了模型无关的Shapley值归因和大型语言模型（LLM）生成的解释。在CLASS框架的反馈质量维度上的测试中，Shapley值在解释模型预测方面比LLM生成的理由更可靠和可转移。 AI

影响为评估教育评估中AI模型解释的忠实度和可转移性提供了一种更稳健的方法。

排序理由该集群包含一篇学术论文，详细介绍了用于评估LLM理由和SHAP在基于评分标准的评估中的新框架。[lever_c_demoted from research: ic=1 ai=1.0]

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Ivo Bueno, Babette B\"uhler, Philipp Stark, Tim F\"utterer, Ulrich Trautwein, Dorottya Demszky, Heather Hill, Enkelejda Kasneci · 2026-06-05 04:00

从评分到解释：评估SHAP和LLM的理由在基于评分标准的教学质量评估中的应用

arXiv:2606.05180v1 Announce Type: new Abstract: Automated scoring models are increasingly used to assign rubric-based quality ratings to complex language performances, including classroom transcripts, yet they typically provide little insight into why a particular score is produc…