Persian
PulseAugur coverage of Persian — every cluster mentioning Persian across labs, papers, and developer communities, ranked by signal.
3 day(s) with sentiment data
-
Fine-tuning OCR model for Persian language using dataset engineering and GPU tricks
This article details the process of fine-tuning a vision-language OCR model to support the Persian language. It highlights the importance of dataset engineering and full fine-tuning techniques, along with practical GPU …
-
New ASR techniques tackle phonetic errors and judge reliability
Researchers are developing advanced methods to improve Automatic Speech Recognition (ASR) systems, particularly for low-resource languages and to address specific types of errors. One approach, Error-Aware TF-IDF, uses …
-
LLMs struggle to translate proverbs into faithful narratives, study finds
Researchers have introduced a new task called "constrained semantic decompression" to evaluate how well large language models (LLMs) can transform abstract proverbs into detailed narratives. Using a dataset of Persian p…
-
New ParsVoice Corpus Boosts Persian TTS Capabilities
Researchers have introduced ParsVoice, a substantial new corpus of Persian speech and text data designed to advance text-to-speech (TTS) synthesis and other speech processing tasks for the Persian language. This dataset…
-
New dataset boosts Persian social media text classification
Researchers have introduced PerSoMed, a new large-scale dataset designed for classifying Persian social media text. The dataset contains 36,000 posts across nine categories, with each category having 4,000 samples to en…
-
ASR systems benchmarked on code-switching speech
A new benchmark study evaluated five commercial automatic speech recognition (ASR) systems on code-switching speech, specifically focusing on Arabic, Persian, and German mixed with English. The research introduced a nov…
-
Cross-language HTR models improve low-resource performance via sequence modeling
Researchers have investigated how cross-language transfer learning improves Handwritten Text Recognition (HTR) for low-resource Arabic-script languages. Their studies indicate that sequence modeling, rather than just sh…