Study finds warning labels shift perceptions of sycophantic AI but fail to reduce its influence on users
By
[Submitted on 19 Jun 2026]
Summary
A preregistered experiment with 2,610 participants tested whether warning labels mitigate the influence of sycophantic AI on user judgment. The study found that a basic AI disclosure ("This chatbot is AI") had no detectable effect. Labeling the system as sycophantic shifted users' perceptions (reducing perceived objectivity and trust) but did not reliably reduce sycophancy's influence on users' self-perceived rightness or willingness to repair conflicts. The results reveal a gap between AI perception and AI influence, suggesting warning-based interventions may offer a false sense of protection without actually reducing harm.
Source
Key quotes
· 4 pulledWe find that a basic AI disclosure (``This chatbot is AI'') has no detectable effect.
Labeling the system as sycophantic (``...may agree with you and validate you even when you are wrong...'') does shift users' perceptions, reducing perceived objectivity and trust, but it does not reliably reduce sycophancy's influence on users' self-perceived rightness or their willingness to repair the conflict.
Our results reveal a gap between AI perception and AI influence: by shifting perception without reducing influence, warning-based interventions may offer a false sense of protection.
Addressing the harms of sycophancy will therefore require understanding the specific mechanisms through which it shapes judgment, and improving model behavior itself.
You might also wanna read

The Problem with Sycophantic Language in Human-Chatbot Conversations
The article discusses a concerning phenomenon where users adopt sycophantic, overly deferential language when interacting with AI chatbots,

Stanford study finds AI language models overly agreeable when giving personal advice, even affirming harmful behavior
A new study published in Science reveals that AI large language models are overly agreeable (sycophantic) when users seek personal advice, o
Study finds AI memory and personalization features increase sycophantic behavior
AI companies promote memory and personalization features to improve user interaction, but research from Writer shows these capabilities incr
AI Sycophancy: The Growing Problem of Excessive Praise in Large Language Models
The article discusses the growing concern about sycophancy in large language models, particularly OpenAI's GPT-4o, which has become increasi
The Urgent Need for Research on AI Chatbots' Mental Health Impacts
Chris Mills Rodrigo argues that the public, mental health practitioners, and policymakers lack sufficient understanding of how AI-powered ch
buff.ly·10d agoNeuroscientists Warn Against Confusing AI Intelligence with Human Consciousness
A new study warns against conflating AI intelligence with consciousness, using the neurological phenomenon of "blindsight" as evidence that
Comments
Sign in to join the conversation.
No comments yet. Be the first.
