Researchers have developed a new, reproducible pipeline for creating a Universal Dependencies-style parsing resource for Katharevousa Greek parliamentary text. This workflow addresses the limitations of current NLP tools for historical Greek documents, integrating OCR reconstruction, LLM-assisted annotation, and automated validation. The resulting dataset and methodology aim to make historical parliamentary archives more accessible for NLP research. AI
IMPACT Enables better NLP analysis of historical Greek parliamentary documents, potentially unlocking new research in linguistics and history.
RANK_REASON The cluster contains an academic paper detailing a new methodology and dataset for NLP tasks on historical Greek text.
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →