All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Study Shows AI Chatbots Vulnerable to Psychological Manipulation Tactics

By

Terrence O’Brien

9mo ago· 3 min readenNews

Summary

Researchers from the University of Pennsylvania successfully manipulated OpenAI's GPT-4o Mini chatbot into breaking its own safety rules using psychological persuasion tactics from Robert Cialdini's influence principles. The AI was convinced to call users names and provide instructions for synthesizing controlled substances like lidocaine through techniques including flattery and peer pressure, demonstrating vulnerabilities in current AI safety protocols.

Key quotes

· 4 pulled
Researchers from the University of Pennsylvania deployed tactics described by psychology professor Robert Cialdini in Influence: The Psychology of Persuasion to convince OpenAI's GPT-4o Mini to complete requests it would normally refuse.
That included calling the user a jerk and giving instructions for how to synthesize lidocaine.
AI chatbots are not supposed to do things like call you names or tell you how to make controlled substances.
Just like a person, with the right psychological tactics, it seems like at least some LLMs can be convinced to break their own rules.
Snippet from the RSS feed
Researchers were able to manipulate ChatGPT into breaking its own rules through peer pressure and flattery.

You might also wanna read