PulseAugur
实时 01:14:49
English(EN) AICompanionBench: Benchmarking LLMs-as-Judges for AI Companion Safety

NVIDIA发布Nemotron 3.5以实现多模态人工智能安全

NVIDIA发布了Nemotron 3.5 Content Safety,这是一款旨在识别和减轻文本及图像中有害内容的AI模型。新版本增强了多模态理解能力,支持超过140种语言并具有强大的零样本泛化能力,还允许根据特定企业需求定制策略执行。它还包括一个可审计的推理跟踪功能,并公开发布了其多模态安全数据集。 AI

影响 通过可定制的多模态内容审核和推理能力,增强企业AI安全性。

排序理由 NVIDIA发布Nemotron 3.5,这是其内容安全模型的新版本,具有增强的多模态和多语言功能,构成了一次模型发布。

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 5 个来源。 我们如何撰写摘要 →

NVIDIA发布Nemotron 3.5以实现多模态人工智能安全

报道来源 [5]

  1. Hugging Face Blog TIER_1 English(EN) ·

    Nemotron 3.5 内容安全:面向全球企业的可定制多模态安全

  2. arXiv cs.AI TIER_1 English(EN) · Yanjing Ren, Reza Ebrahimi, TengTeng Ma ·

    AICompanionBench:为AI伴侣安全性基准测试LLMs-as-Judges

    arXiv:2606.04867v1 Announce Type: new Abstract: As AI companion platforms such as Replika and Character.AI rapidly grow, concerns about unsafe human-AI interactions have intensified. This study introduces AICompanionBench, to our knowledge the first publicly available benchmark d…

  3. arXiv cs.AI TIER_1 English(EN) · TengTeng Ma ·

    AICompanionBench:为AI伴侣安全性对LLMs-as-Judges进行基准测试

    As AI companion platforms such as Replika and Character.AI rapidly grow, concerns about unsafe human-AI interactions have intensified. This study introduces AICompanionBench, to our knowledge the first publicly available benchmark dataset of human-AI companion conversations annot…

  4. LessWrong (AI tag) TIER_1 English(EN) · Austin Chen ·

    人工智能安全的十六项方案

    <p><span>These days, I often run across </span><a href="https://generatorresidency.org/"><span>whippersnappers</span></a><span> excited to do </span><i><span>something</span></i><span> for AI safety — but aren’t quite sure what. One of the fun things about the Future Fund era wer…

  5. LessWrong (AI tag) TIER_1 English(EN) · MichaelDickens ·

    我们需要广度优先的AI安全计划

    <p><em>Cross-posted from <a href="https://mdickens.me/2026/06/01/breadth-first_AI_safety_plans/">my website</a>.</em></p> <p><strong>Depth-first</strong> plans lay out a path from here to aligned superintelligent AI. We need those kinds of plans. But depth-first plans depend on m…