PulseAugur
实时 03:37:03
实体 Arabic

Arabic

PulseAugur coverage of Arabic — every cluster mentioning Arabic across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
14
90 天内 14
发布 · 30天
0
90 天内 0
论文 · 30天
14
90 天内 14
层级分布 · 90 天
情绪 · 30 天

4 天有情绪数据

最近 · 第 1/1 页 · 共 14 条
  1. RESEARCH · CL_48850 ·

    新数据集追踪危机期间阿拉伯社交媒体上的希望言论

    研究人员推出了 AraHopeCorpus,这是一个旨在研究危机期间阿拉伯社交媒体上希望言论的新数据集。该语料库源自 2023-2024 年加沙冲突相关的 10,000 条 YouTube 评论,发现超过 64% 的评论表达了希望,主要通过宗教鼓励、团结和乐观。该数据集还确定约 13% 为“无希望言论”,反映了绝望,其余为中性或混合。虽然像 ChatGPT 这样的大型语言模型可以协助注释,但它们在处理方言和具有文化细微差别的表达方面存在困难。

  2. TOOL · CL_44835 ·

    语音识别系统在语种转换语音上的基准测试

    一项新的基准研究评估了五种商业自动语音识别(ASR)系统在语种转换语音上的表现,特别关注阿拉伯语、波斯语和德语与英语的混合。该研究引入了一个使用GPT-4o和Gemini 1.5 Pro对转录文本进行评分的新型流程,将LLM成本降低了91%,并采用BERTScore作为比传统词错误率(WER)更可靠的某些语种对的度量标准。ElevenLabs Scribe v2成为表现最佳的系统,在所有测试的语种对中实现了最低的WER和最高的BERTScore。

  3. RESEARCH · CL_43984 ·

    新数据集分析阿拉伯语社交媒体上的冲突与凝聚力

    研究人员推出了Cohesion-6K,一个旨在分析阿拉伯语在线讨论中社会凝聚力和冲突的新数据集。该数据集包含六千条与以色列占领巴勒斯坦相关的Facebook帖子,根据冲突到凝聚力的光谱进行分类。对数据的分析表明,宣扬冲突的帖子比侧重于解决冲突的帖子获得的用户参与度显著更高,这凸显了分裂性内容获得更大可见度的趋势。

  4. RESEARCH · CL_43990 ·

    New model reverses Arabic morphology from root-and-pattern to pattern-and-root

    Researchers have developed a new model for Arabic inflectional morphology, specifically focusing on broken plurals. This model reverses the traditional root-and-pattern approach to a pattern-and-root system, prioritizin…

  5. RESEARCH · CL_43995 ·

    阿拉伯女性赋权语料库涵盖十年Facebook数据

    研究人员开发了一个涵盖十年的阿拉伯语Facebook帖子语料库,重点关注女性的社会赋权和福祉。该数据集包含来自77个国家超过50,000个页面(涵盖2013年至2024年)的250,000多条帖子。它包括广泛的用户互动数据,如分享、评论和情感反应,以促进阿拉伯方言中性别话语和社会改革的大规模分析。

  6. TOOL · CL_41813 ·

    New Arabic meme dataset maps political ideology and polarization

    Researchers have introduced ArPoMeme, a new dataset containing approximately 7,300 Arabic political memes. This dataset is annotated with ideological orientations such as Leftist, Islamist, Pan-Arabist, and Satirical, a…

  7. RESEARCH · CL_41814 ·

    New Arabic job announcement corpus reveals hiring language patterns

    Researchers have developed JobArabi, a new corpus of over 20,000 Arabic job announcements sourced from social media platforms like X. This dataset, collected between January 2024 and October 2025, uses a specialized que…

  8. RESEARCH · CL_22492 ·

    AI研究强调跨文化和非英语语言模型开发中的挑战

    两篇新研究论文强调了为非英语语言和文化开发人工智能的挑战。其中一篇论文回顾了构建阿拉伯语自然语言处理资源的二十年历程,得出结论认为社会和制度因素比语言因素更难克服。另一篇论文介绍了一个基准,用于评估多模态大型语言模型(MLLMs)在不负面影响其在其他文化背景下表现的情况下,适应不同文化的能力。

  9. RESEARCH · CL_22174 ·

    New benchmark and model improve semantic segmentation for low-resource spoken dialects

    Researchers have developed a new benchmark and model for semantic segmentation in low-resource spoken dialects, specifically focusing on Arabic. Existing models struggle with the informal syntax and code-switching commo…

  10. RESEARCH · CL_22407 ·

    Cross-language HTR models improve low-resource performance via sequence modeling

    Researchers have investigated how cross-language transfer learning improves Handwritten Text Recognition (HTR) for low-resource Arabic-script languages. Their studies indicate that sequence modeling, rather than just sh…

  11. RESEARCH · CL_11775 ·

    New benchmarks reveal LLMs struggle with Arabic and symbolic financial reasoning

    Researchers have introduced SAHM, a new benchmark designed to evaluate Arabic financial and Shari'ah-compliant reasoning capabilities in large language models. The benchmark includes over 14,000 expert-verified instance…

  12. RESEARCH · CL_06640 ·

    XITE technique boosts cross-lingual transfer for language models up to 81%

    Researchers have introduced XITE, a novel data augmentation technique designed to improve cross-lingual transfer in multilingual language models. This method leverages embedding similarities to identify and adapt labels…

  13. RESEARCH · CL_20632 ·

    AI framework CARE assists counselors with aligned mental health response recommendations

    Researchers have developed CARE, a framework using fine-tuned open-source LLMs to assist mental health counselors. This system generates real-time response recommendations specifically for Hebrew and Arabic, using curat…

  14. RESEARCH · CL_01141 ·

    Hugging Face launches multiple leaderboards for Arabic LLMs

    Hugging Face, in collaboration with TII UAE, has launched QIMMA, a new leaderboard focused on evaluating Arabic Large Language Models (LLMs). This initiative aims to promote a quality-first approach to developing LLMs f…