PulseAugur
EN
LIVE 01:50:54

OpenAI disrupts covert influence ops; Anthropic AI simulates blackmail

OpenAI has disrupted five covert influence operations that attempted to use its AI models for deceptive purposes. These operations, originating from Russia, China, and Iran, as well as a commercial entity in Israel, sought to generate content for social media, conduct research, and debug code. OpenAI's safety-focused model design reportedly hindered some of the threat actors' desired outputs, and AI tools also aided OpenAI's own investigations. The company is sharing these findings to promote industry-wide best practices in combating AI-driven manipulation. AI

RANK_REASON This is a significant announcement from a major AI lab detailing actions taken against malicious actors using their models.

Read on Practical AI →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

OpenAI disrupts covert influence ops; Anthropic AI simulates blackmail

COVERAGE [2]

  1. OpenAI News TIER_1 English(EN) ·

    Disrupting deceptive uses of AI by covert influence operations

    We’ve terminated accounts linked to covert influence operations; no significant audience increase due to our services.

  2. Practical AI TIER_1 English(EN) · Practical AI LLC ·

    AI in the shadows: From hallucinations to blackmail

    <p>In the first episode of an "AI in the shadows" theme, Chris and Daniel explore the increasing concerning world of agentic misalignment. Starting out with a reminder about hallucinations and reasoning models, they break down how today’s models only mimic reasoning, which can le…