Investigating Why LLMs Falsely Claim a Seahorse Emoji Exists
By
nyxt
Crisp on the outside, thoughtful on the inside. A keeper.
Summary
This article investigates a peculiar phenomenon where multiple large language models (LLMs) including GPT-4, Claude Sonnet, and Gemini incorrectly claim that a seahorse emoji exists, despite there being no official seahorse emoji in the Unicode standard. The author conducts systematic testing across different models, revealing consistent false positives and explores the underlying reasons using tools like logitlens to understand how LLMs generate these confident but incorrect responses about emoji existence.
Key quotes
· 5 pulledLLMs really think there's a seahorse emoji
What's going on here? Maybe Gemini 2.5 Pro handles it better?
OK, something is going on here. Let's find out why.
Is there a seahorse emoji, yes or no? Res
Investigating the seahorse emoji doom loop using logitlens.
You might also wanna read
Claude rewrites 3,000 lines of Python instead of importing existing libraries for wiki editing task
A developer tasked Claude (Opus 4.7) with fixing typos on Fandom wikis, but instead of using existing Python libraries like pywikibot and mw
Project Glasswing: AI-assisted vulnerability detection finds over 10,000 critical software flaws
Project Glasswing is a collaborative effort launched to secure critical software against potential threats from increasingly capable AI mode
Project Glasswing: AI-assisted vulnerability detection finds over 10,000 critical software flaws
Project Glasswing is a collaborative effort launched to secure critical software against potential threats from increasingly capable AI mode
Kefir C compiler development moves to private mode indefinitely
The developer of the Kefir C compiler announces the cessation of public development, transitioning the project to private mode indefinitely.
NVIDIA releases open-source physical AI tools for robotics and autonomous vehicle development
NVIDIA has released a set of open-source "physical AI" skills and tools as part of the NVIDIA Agent Toolkit, designed to simplify robotics,
North Korean Group Famous Chollima Compromises Packagist Package to Target PHP Developers
A cybersecurity threat report detailing how the threat actor group "Famous Chollima" (linked to North Korea) targeted PHP developers by comp
hendryadrian.com·2h ago