PulseAugur
实时 12:44:54
English(EN) Automated Essay Scoring and Language Certification: Assessing Generalizability, Agreement and Validity for French

新框架提升法语语言测试的自动化作文评分

研究人员为自动化作文评分(AES)系统开发了一个增强的基于论证的验证(ABV)框架,专注于法语语言测试。这个改进的框架包括公平性分析、语言特征相关性、预测误差评估以及模型与人类评分者的一致性。该研究将此框架应用于比较八种不同的模型架构,使用了一个大型法语作文语料库,旨在更全面地理解AES模型的能力和局限性。 AI

影响 为用于高风险语言评估的AI系统提供了一个更稳健的评估方法。

排序理由 该集群包含一篇详细介绍新框架及其在自动化作文评分中应用的学术论文。

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

报道来源 [2]

  1. arXiv cs.CL TIER_1 English(EN) · Rodrigo Wilkens, R\'emi Cardon, Vincent Folny, Thomas Fran\c{c}ois ·

    Automated Essay Scoring and Language Certification: Assessing Generalizability, Agreement and Validity for French

    arXiv:2606.02009v1 Announce Type: new Abstract: In Automated Essay Scoring (AES), benchmarking practices have fostered minimalist evaluation practices, in contrast with the broader-view recommendations of evaluation frameworks, such as the argument-based validation framework (ABV…

  2. arXiv cs.CL TIER_1 English(EN) · Thomas François ·

    Automated Essay Scoring and Language Certification: Assessing Generalizability, Agreement and Validity for French

    In Automated Essay Scoring (AES), benchmarking practices have fostered minimalist evaluation practices, in contrast with the broader-view recommendations of evaluation frameworks, such as the argument-based validation framework (ABV), which argued in favor of a multidimensional a…