PulseAugur
LIVE 03:48:09
tool · [1 source] ·
0
tool

New K-12 knowledge graph benchmarks LLM curriculum cognition

Researchers have developed K12-KGraph, a novel knowledge graph designed to evaluate and train large language models (LLMs) specifically for K-12 education. This graph, derived from official textbooks, captures curriculum structure, including prerequisites and concept relationships, going beyond simple factual recall. To support this, they created K12-Bench, a 23,640-question benchmark, and K12-Train, a fine-tuning dataset. Experiments show current LLMs struggle with curriculum cognition, and the K12-Train dataset significantly improves performance on educational benchmarks with high sample efficiency. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Establishes a new benchmark for evaluating LLM understanding of educational curricula, potentially driving development of more pedagogically aware AI.

RANK_REASON The cluster describes a new academic paper introducing a novel dataset and benchmark for evaluating LLMs in an educational context. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

COVERAGE [1]

  1. arXiv cs.CL TIER_1 · Wentao Zhang ·

    K12-KGraph: A Curriculum-Aligned Knowledge Graph for Benchmarking and Training Educational LLMs

    Large language models (LLMs) are increasingly used in K-12 education, yet existing benchmarks such as C-Eval, CMMLU, GaokaoBench, and EduEval mainly evaluate factual recall through exam-style question answering. Effective educational AI additionally requires curriculum cognition:…