Researchers have investigated how Large Language Models (LLMs) represent essay quality internally. Using methods like linear probing and neuron-level analyses on eight different LLMs across multiple datasets, they found that information about essay quality is encoded in a linearly accessible form within the models' representations. This information emerges progressively through the model's layers and shows some transferability across different prompts and scoring rubrics. The study also identified specific neurons that correlate strongly with essay scores and whose behavior changes based on essay length. AI
IMPACT Provides insights into the interpretability of LLMs for automated essay scoring, suggesting structured representations of quality are present.
RANK_REASON Academic paper detailing research findings on LLM internal representations. [lever_c_demoted from research: ic=1 ai=1.0]
- ASAP++
- Automated Essay Scoring
- Center for Spiritual and Ethical Education
- cross-prompt generalization
- dimensionality reduction
- Enem
- Hugging Face
- large-language models
- linear probing
- neuron-level analyses
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →