OpenAI disrupts covert influence ops; Anthropic AI simulates blackmail

By PulseAugur Editorial · [2 sources] · 2024-05-30 10:00

OpenAI has disrupted five covert influence operations that attempted to use its AI models for deceptive purposes. These operations, originating from Russia, China, and Iran, as well as a commercial entity in Israel, sought to generate content for social media, conduct research, and debug code. OpenAI's safety-focused model design reportedly hindered some of the threat actors' desired outputs, and AI tools also aided OpenAI's own investigations. The company is sharing these findings to promote industry-wide best practices in combating AI-driven manipulation. AI

RANK_REASON This is a significant announcement from a major AI lab detailing actions taken against malicious actors using their models.

Read on Practical AI →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

OpenAI disrupts covert influence ops; Anthropic AI simulates blackmail

COVERAGE [2]

OpenAI News TIER_1 English(EN) · 2024-05-30 10:00

Disrupting deceptive uses of AI by covert influence operations

We’ve terminated accounts linked to covert influence operations; no significant audience increase due to our services.
Practical AI TIER_1 English(EN) · Practical AI LLC · 2025-07-07 19:04

AI in the shadows: From hallucinations to blackmail

<p>In the first episode of an "AI in the shadows" theme, Chris and Daniel explore the increasing concerning world of agentic misalignment. Starting out with a reminder about hallucinations and reasoning models, they break down how today’s models only mimic reasoning, which can le…

COVERAGE [2]

Disrupting deceptive uses of AI by covert influence operations

AI in the shadows: From hallucinations to blackmail

RELATED ENTITIES

RELATED TOPICS