PulseAugur
EN
LIVE 13:51:01

New methodology digitizes Arabic-English dictionary for computational use

This paper details a method for digitizing and encoding the Al-Mawrid Arabic-English dictionary using the ISO Lexical Markup Framework and TEI Lex-0 guidelines. The research addresses inconsistencies in legacy dictionaries and aims to create a standardized, machine-tractable computational lexicon. The methodology achieved a structural parsing accuracy of 91% and demonstrated high precision and recall for extracting synonyms and other morpho-semantic features. AI

RANK_REASON The cluster describes an academic paper detailing a methodology for digitizing a legacy dictionary. [lever_c_demoted from research: ic=2 ai=0.4]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

New methodology digitizes Arabic-English dictionary for computational use

COVERAGE [2]

  1. arXiv cs.CL TIER_1 English(EN) · Diaa Fayed, Laurent Romary ·

    Analyzing and Encoding the Al-Mawrid Arabic-English Dictionary with the ISO Language Markup Framework and TEI Lex-0

    arXiv:2606.18205v1 Announce Type: new Abstract: This paper presents a robust methodology for the systematic digitization and encoding of the Al-Mawrid Arabic-English dictionary, transforming it from a legacy print resource into a standardized computational lexicon. Addressing a s…

  2. arXiv cs.CL TIER_1 English(EN) · Laurent Romary ·

    Analyzing and Encoding the Al-Mawrid Arabic-English Dictionary with the ISO Language Markup Framework and TEI Lex-0

    This paper presents a robust methodology for the systematic digitization and encoding of the Al-Mawrid Arabic-English dictionary, transforming it from a legacy print resource into a standardized computational lexicon. Addressing a significant gap in Arabic lexical infrastructure,…