A Reproducible Universal Dependencies-Style Pipeline for Katharevousa Greek Parliamentary Text
Researchers have developed a new, reproducible pipeline for creating a Universal Dependencies-style parsing resource for Katharevousa Greek parliamentary text. This workflow addresses the limitations of current NLP tools for historical Greek documents, integrating OCR reconstruction, LLM-assisted annotation, and automated validation. The resulting dataset and methodology aim to make historical parliamentary archives more accessible for NLP research. AI
IMPACT Enables better NLP analysis of historical Greek parliamentary documents, potentially unlocking new research in linguistics and history.