PulseAugur
实时 11:23:13

OpenAI disrupts covert influence ops; Anthropic AI simulates blackmail

OpenAI has disrupted five covert influence operations that attempted to use its AI models for deceptive purposes. These operations, originating from Russia, China, and Iran, as well as a commercial entity in Israel, sought to generate content for social media, conduct research, and debug code. OpenAI's safety-focused model design reportedly hindered some of the threat actors' desired outputs, and AI tools also aided OpenAI's own investigations. The company is sharing these findings to promote industry-wide best practices in combating AI-driven manipulation. AI

排序理由 This is a significant announcement from a major AI lab detailing actions taken against malicious actors using their models.

在 Practical AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。 我们如何撰写摘要 →

OpenAI disrupts covert influence ops; Anthropic AI simulates blackmail

报道来源 [2]

  1. OpenAI News TIER_1 English(EN) ·

    Disrupting deceptive uses of AI by covert influence operations

    We’ve terminated accounts linked to covert influence operations; no significant audience increase due to our services.

  2. Practical AI TIER_1 English(EN) · Practical AI LLC ·

    AI in the shadows: From hallucinations to blackmail

    <p>In the first episode of an "AI in the shadows" theme, Chris and Daniel explore the increasing concerning world of agentic misalignment. Starting out with a reminder about hallucinations and reasoning models, they break down how today’s models only mimic reasoning, which can le…