PulseAugur
EN
LIVE 09:23:22

Open Diachronic Greek Treebank Released with Indo-European Parallels

Researchers have developed AthDGC, a comprehensive, open-source dataset and workflow for dependency parsing of the Greek language across eight historical periods. This project, built upon the PROIEL Treebank Family schema, includes verse-level cross-alignment with texts in Latin, Gothic, Old Church Slavonic, and Classical Armenian. The current release (v0.4) offers curated samples and an open-source toolkit, with the full annotated corpus undergoing audit for a future release. AI

RANK_REASON The cluster describes the release of an academic paper detailing a new dataset and workflow for linguistic analysis. [lever_c_demoted from research: ic=1 ai=0.7]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.CL TIER_1 English(EN) · Nikolaos Lavidas, Kiki Nikiforidou, Dag Haug, Leonid Kulikov, Vassiliki Geka, Vassileios Symeonidis, Theodoros Michalareas, Sofia Chionidi, Anastasia Tsiropina, Eleni Plakoutsi, Evangelos Argyropoulos ·

    AthDGC: An Open Diachronic Greek Treebank with Indo-European Parallels

    arXiv:2606.15510v1 Announce Type: new Abstract: AthDGC ("Athens-PROIEL") is an open, end-to-end workflow and dataset. It is, to the best of our knowledge, the first openly licensed dependency-parsed treebank of Greek that spans eight diachronic periods, namely Archaic, Classical,…