Researchers have developed a novel framework for translating government documents from Marathi to English, specifically addressing the challenge of preserving document structure and formatting. This system integrates layout-aware OCR, coordinate-based text extraction, and large language models to ensure that the translated documents maintain their original layout and hierarchical elements. Evaluations on real-world Marathi government PDFs show that this approach significantly improves structural preservation, translation coherence, and terminological consistency compared to standard text-only translation methods, aiming to enhance multilingual accessibility in e-governance. AI
IMPACT Enhances accessibility of government documents across languages, potentially streamlining administrative processes and policy analysis.
RANK_REASON Academic paper detailing a novel technical approach to document translation. [lever_c_demoted from research: ic=1 ai=1.0]
- alphaXiv
- arXiv
- CatalyzeX
- Connected Papers
- DagsHub
- E-Governance
- Gotit.pub
- HTML
- Hugging Face
- India
- Litmaps
- LLM
- Marathi
- ScienceCast
- scite Smart Citations
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →