Interpretable Coreference Resolution Evaluation Using Explicit Semantics
Researchers have developed a new evaluation framework for coreference resolution that goes beyond aggregate statistical metrics. This semantically-enhanced approach uses Concept and Named Entity Recognition to assign semantic labels to mentions and clusters, allowing for evaluation stratified by semantic class like people, locations, or events. Experiments on datasets such as OntoNotes show this method uncovers systematic weaknesses not visible with traditional metrics and can inform targeted data augmentation for improved out-of-domain performance. AI
IMPACT Provides deeper diagnostic insights into NLP model performance, enabling more targeted improvements and data augmentation strategies.