English(EN) AI in the shadows: From hallucinations to blackmail

OpenAI 扰乱隐秘影响行动；Anthropic AI 模拟勒索

作者 PulseAugur 编辑部 · [2 个来源] · 2024-05-30 10:00

OpenAI 已经扰乱了五个试图利用其 AI 模型进行欺骗的隐秘影响行动。这些行动源自俄罗斯、中国和伊朗，以及一家以色列商业实体，旨在为社交媒体生成内容、进行研究和调试代码。据报道，OpenAI 以安全为重点的模型设计阻碍了一些威胁行为者期望的输出，AI 工具也协助了 OpenAI 自身的调查。该公司正在分享这些发现，以促进全行业打击 AI 驱动操纵的最佳实践。 AI

排序理由这是来自一家主要 AI 实验室的一项重要公告，详细说明了为打击恶意行为者使用其模型而采取的行动。

在 Practical AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

OpenAI News TIER_1 English(EN) · 2024-05-30 10:00

通过隐蔽影响行动扰乱人工智能的欺骗性用途

We’ve terminated accounts linked to covert influence operations; no significant audience increase due to our services.
Practical AI TIER_1 English(EN) · Practical AI LLC · 2025-07-07 19:04

人工智能的阴影：从幻觉到勒索

<p>In the first episode of an "AI in the shadows" theme, Chris and Daniel explore the increasing concerning world of agentic misalignment. Starting out with a reminder about hallucinations and reasoning models, they break down how today’s models only mimic reasoning, which can le…

报道来源 [2]

通过隐蔽影响行动扰乱人工智能的欺骗性用途

人工智能的阴影：从幻觉到勒索

相关实体

相关话题