Ensuring Reliability in Programming Knowledge Tracing: A Re-evaluation of Attention-augmented Models and Experimental Protocols
A new study re-evaluates attention-augmented models for Programming Knowledge Tracing (PKT), finding that their reported performance gains are highly sensitive to experimental design choices. The research highlights issues with attention dimension settings and temporal causality violations due to improper ordering of student attempts. By implementing a controlled evaluation protocol, the study demonstrates a significantly reduced performance gap between complex attention-enhanced models and standard Deep Knowledge Tracing (DKT) models, suggesting that increased architectural complexity does not consistently yield superior results. AI
IMPACT Provides practical guidance for reliable and comparable evaluation in programming knowledge tracing, potentially impacting how educational AI models are assessed.