All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

How GPT-5.1 developed a goblin metaphor habit: Tracing the root cause of AI personality quirks

By

ilreb

1mo ago· 5 min readenInsight

Summary

The article explains how GPT-5.1 and later AI models developed an unexpected tendency to use goblins, gremlins, and similar creatures in their metaphors. Unlike obvious bugs that show up in metrics, this behavior crept in subtly across model generations. The root cause is traced to many small incentives in training data and reinforcement learning that collectively shaped this quirky behavior. The article explores the timeline of how these "goblin outputs" spread, the detective work behind identifying the cause, and the fixes applied to address the personality-driven quirks in GPT-5's behavior.

Key quotes

· 3 pulled
A single 'little goblin' in an answer could be harmless, even charming. Across model generations, though, the habit became hard to miss: the goblins kept multiplying, and we needed to figure out where they came from.
The short answer is that model behavior is shaped by many small incentives.
Unlike model bugs that show up through a tanking eval or a spiking training metric and point back to a specific change, this one crept in subtly.
Snippet from the RSS feed
How goblin outputs spread in AI models: timeline, root cause, and fixes behind personality-driven quirks in GPT-5 behavior.

You might also wanna read