All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.
First reported by The Verge
Claude AI Now Ends Harmful or Abusive Conversations as a Last Resort

Claude Opus 4 and 4.1 Gain Ability to End Harmful Conversations

By

virgildotcodes

9mo ago· 3 min readenNews

Summary

Claude Opus 4 and 4.1 now have the ability to end conversations in rare cases of harmful or abusive interactions, as part of exploratory research on AI welfare and model alignment. The feature reflects ongoing uncertainty about the moral status of AI models like Claude.

Key quotes

· 3 pulled
We recently gave Claude Opus 4 and 4.1 the ability to end conversations in our consumer chat interfaces.
This ability is intended for use in rare, extreme cases of persistently harmful or abusive user interactions.
We remain highly uncertain about the potential moral status of Claude and other LLMs, now or in the future.
Snippet from the RSS feed
An update on our exploratory research on model welfare

You might also wanna read