English(EN) IndicGuard: A Multilingual Safety Guard Model and Dataset for Indic Languages

新的IndicGuard模型增强了印度语言的LLM安全性

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-22 04:36

研究人员开发了IndicGuard，这是一个专门为印度语言设计的新型多语言安全防护模型和数据集。该模型通过捕捉独特的区域性危害和社会政治敏感性，解决了现有以英语为中心的安全性机制的局限性。IndicGuard在基于Gemma-3-4B-IT的4B参数模型上进行了微调，在十种主要的印度语言中表现出更强的鲁棒性和审核一致性，优于基线CultureGuard模型。该框架还对未包含在其训练数据中的低资源印度语言表现出有效的泛化能力。 AI

影响增强了印度语言的LLM安全性和文化适应性，可能促进LLM的全球采用。

排序理由该集群描述了一篇关于针对特定语言的新型安全模型和数据集的详细研究论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.CL 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.CL TIER_1 English(EN) · Raviraj Joshi · 2026-06-22 04:36

IndicGuard: A Multilingual Safety Guard Model and Dataset for Indic Languages

As Large Language Models (LLMs) achieve widespread integration across diverse linguistic landscapes, ensuring their safety and alignment with regional normative values remains a critical challenge. Current safety mechanisms are predominantly optimized for English-centric framewor…

报道来源 [1]

IndicGuard: A Multilingual Safety Guard Model and Dataset for Indic Languages

相关实体

相关话题