New K-12 knowledge graph benchmarks LLM curriculum cognition

By PulseAugur Editorial · [1 sources] · 2026-05-10 16:24

Researchers have developed K12-KGraph, a novel knowledge graph designed to evaluate and train large language models (LLMs) specifically for K-12 education. This graph, derived from official textbooks, captures curriculum structure, including prerequisites and concept relationships, going beyond simple factual recall. To support this, they created K12-Bench, a 23,640-question benchmark, and K12-Train, a fine-tuning dataset. Experiments show current LLMs struggle with curriculum cognition, and the K12-Train dataset significantly improves performance on educational benchmarks with high sample efficiency. AI

IMPACT Establishes a new benchmark for evaluating LLM understanding of educational curricula, potentially driving development of more pedagogically aware AI.

RANK_REASON The cluster describes a new academic paper introducing a novel dataset and benchmark for evaluating LLMs in an educational context. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New K-12 knowledge graph benchmarks LLM curriculum cognition

COVERAGE [1]

arXiv cs.CL TIER_1 English(EN) · Wentao Zhang · 2026-05-10 16:24

K12-KGraph: A Curriculum-Aligned Knowledge Graph for Benchmarking and Training Educational LLMs

Large language models (LLMs) are increasingly used in K-12 education, yet existing benchmarks such as C-Eval, CMMLU, GaokaoBench, and EduEval mainly evaluate factual recall through exam-style question answering. Effective educational AI additionally requires curriculum cognition:…

COVERAGE [1]

K12-KGraph: A Curriculum-Aligned Knowledge Graph for Benchmarking and Training Educational LLMs

RELATED ENTITIES

RELATED TOPICS