English(EN) RoTRAG: Rule of Thumb Reasoning for Conversation Harm Detection with Retrieval-Augmented Generation

新AI框架RoTRAG增强对话中有害内容的检测

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-16 04:00

研究人员开发了RoTRAG，一个旨在增强多轮对话中有害内容检测能力的新型框架。该系统通过整合人工制定的道德规范（称为经验法则，RoTs）来增强检索增强生成，为推理提供明确的规范性证据。RoTRAG还包含一个轻量级分类器，可高效判断何时需要基于检索的推理，从而减少冗余计算。在基准数据集上的实验表明，与现有方法相比，在危害分类和严重性估计方面有了显著改进。 AI

影响该框架有望为内容审核和安全领域带来更可靠、更具可解释性的AI系统。

排序理由该集群描述了一篇在arXiv上发表的研究论文，详细介绍了一个新AI框架。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Juhyeon Lee, Wonduk Seo, Junseo Koh, Seunghyun Lee, Haihua Chen, Yi Bu · 2026-06-16 04:00

RoTRAG: Rule of Thumb Reasoning for Conversation Harm Detection with Retrieval-Augmented Generation

arXiv:2604.17301v2 Announce Type: replace-cross Abstract: Detecting harmful content in multi turn dialogue requires reasoning over the full conversational context rather than isolated utterances. However, most existing methods rely mainly on models internal parametric knowledge, …

报道来源 [1]

RoTRAG: Rule of Thumb Reasoning for Conversation Harm Detection with Retrieval-Augmented Generation

相关实体

相关话题