Researchers have compared the effectiveness of Large Language Models (LLMs) and Small Language Models (SLMs) for designing educational assessment questions. The study found that SLMs can perform comparably to LLMs on various pedagogical quality dimensions, offering advantages in privacy and local deployment. However, the research also highlighted that model-based evaluations can be inconsistent and biased compared to expert human judgment, emphasizing the need for human oversight in assessment workflows. AI
影响 SLMs offer a viable, privacy-preserving alternative for AI-assisted educational assessment design, though human oversight remains crucial.
排序理由 Academic paper detailing a systematic comparison of LLMs and SLMs for a specific task. [lever_c_demoted from research: ic=1 ai=1.0]
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →