PulseAugur
EN
LIVE 19:03:10

New pipeline creates NLP resource for historical Greek parliamentary text

Researchers have developed a new, reproducible pipeline for creating a Universal Dependencies-style parsing resource for Katharevousa Greek parliamentary text. This workflow addresses the limitations of current NLP tools for historical Greek documents, integrating OCR reconstruction, LLM-assisted annotation, and automated validation. The resulting dataset and methodology aim to make historical parliamentary archives more accessible for NLP research. AI

IMPACT Enables better NLP analysis of historical Greek parliamentary documents, potentially unlocking new research in linguistics and history.

RANK_REASON The cluster contains an academic paper detailing a new methodology and dataset for NLP tasks on historical Greek text.

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

COVERAGE [2]

  1. arXiv cs.CL TIER_1 · George Mikros, Fotios Fitsilis ·

    A Reproducible Universal Dependencies-Style Pipeline for Katharevousa Greek Parliamentary Text

    arXiv:2605.22978v1 Announce Type: new Abstract: Katharevousa Greek remains poorly served by contemporary NLP pipelines despite its importance for legal, administrative, and parliamentary archives. We present a reproducible workflow for building and evaluating a Universal Dependen…

  2. arXiv cs.CL TIER_1 · Fotios Fitsilis ·

    A Reproducible Universal Dependencies-Style Pipeline for Katharevousa Greek Parliamentary Text

    Katharevousa Greek remains poorly served by contemporary NLP pipelines despite its importance for legal, administrative, and parliamentary archives. We present a reproducible workflow for building and evaluating a Universal Dependencies-style parsing resource for Katharevousa par…