PulseAugur
EN
LIVE 07:31:28

Arabic-English Dictionary Digitized Using ISO LMF and TEI Lex-0

Researchers have developed a method to digitize and encode the Al-Mawrid Arabic-English dictionary, transforming it into a computational lexicon. This process aligns the ISO Lexical Markup Framework (LMF) with TEI Lex-0 guidelines, addressing inconsistencies in legacy dictionaries. The study achieved 91% structural parsing accuracy and demonstrated high precision and recall for extracting synonyms and morpho-semantic features, paving the way for integration into the semantic web and benefiting Arabic NLP and digital humanities. AI

RANK_REASON The cluster contains an academic paper detailing a methodology for digitizing and encoding a dictionary using established frameworks. [lever_c_demoted from research: ic=1 ai=0.4]

Read on arXiv cs.CL →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.CL TIER_1 English(EN) · Laurent Romary ·

    Analyzing and Encoding the Al-Mawrid Arabic-English Dictionary with the ISO Language Markup Framework and TEI Lex-0

    This paper presents a robust methodology for the systematic digitization and encoding of the Al-Mawrid Arabic-English dictionary, transforming it from a legacy print resource into a standardized computational lexicon. Addressing a significant gap in Arabic lexical infrastructure,…