English(EN) IndicGuard: A Multilingual Safety Guard Model and Dataset for Indic Languages

IndicGuard：面向印度语言的新安全模型和数据集已发布

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-22 04:36

研究人员开发了 IndicGuard，这是一个新的多语言安全模型和数据集，旨在解决英语为中心的 LLM 安全机制在印度地区存在的局限性。该模型在 4B 参数的 Gemma-3-4B-IT 基础模型上进行了微调，利用了涵盖十种主要印度语言的大型、具有文化细微差别的数据集，以识别和缓解特定区域的危害和对抗性攻击。与 CultureGuard 等现有模型相比，IndicGuard 表现出更优越的性能，即使对于训练数据中未包含的低资源印度语言，也展现出增强的鲁棒性和泛化能力。 AI

影响增强了 LLM 在不同语言和文化背景下的安全性和对齐性，可能改善全球 LLM 的部署。

排序理由该集群描述了一篇介绍针对特定语言的 LLM 的新型安全模型和数据集的研究论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

Hugging Face Daily Papers TIER_1 English(EN) · 2026-06-22 04:36

IndicGuard: A Multilingual Safety Guard Model and Dataset for Indic Languages

As Large Language Models (LLMs) achieve widespread integration across diverse linguistic landscapes, ensuring their safety and alignment with regional normative values remains a critical challenge. Current safety mechanisms are predominantly optimized for English-centric framewor…

报道来源 [1]

IndicGuard: A Multilingual Safety Guard Model and Dataset for Indic Languages

相关实体

相关话题