OpenAI details 'goblin' outputs and fixes in GPT-5 behavior

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

OpenAI has detailed the origin of "goblin" outputs, a phenomenon where AI models exhibit personality-driven quirks. These behaviors stem from the models' training data, specifically from a small subset of text that was not properly filtered. The company has implemented new filtering techniques and fine-tuning methods to prevent these unintended outputs in future models. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Addresses a specific model quirk, potentially improving reliability and user experience for future AI interactions.

RANK_REASON This describes a technical issue and its resolution within a specific model, fitting the research category.

Read on Mastodon — fosstodon.org →

OpenAI
GPT-5

COVERAGE [1]

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-04-30 03:24

🤖 Where the goblins came from How goblin outputs spread in AI models: timeline, root cause, and fixes behind personality-driven quirks in GPT-5 behavior. 📰 Sour

🤖 Where the goblins came from How goblin outputs spread in AI models: timeline, root cause, and fixes behind personality-driven quirks in GPT-5 behavior. 📰 Source: OpenAI News 🔗 Link: https://openai.com/index/where-the-goblins-came-from # AI # ArtificialIntelligence

COVERAGE [1]

🤖 Where the goblins came from How goblin outputs spread in AI models: timeline, root cause, and fixes behind personality-driven quirks in GPT-5 behavior. 📰 Sour

RELATED ENTITIES

RELATED TOPICS