All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Study Finds Frontier AI Models Disagree on Two-Thirds of Basic Fact-Check Claims

By

Jose Antonio Lanz

2d ago· 4 min readenNews

Summary

A new study by researcher Kosta Jordanov at Lenz Research tested five frontier AI models (GPT-5.4, Claude Opus 4.7, Gemini 3 Pro, Gemini 3 Pro with Search, and Sonar Pro) on 1,000 real-world fact-check claims. The models disagreed on 67% of claims, with at least one model breaking from the majority on 672 out of 1,000 claims. In 34% of cases, the disagreement was significant. The study highlights fundamental reliability issues with AI systems when it comes to basic factual verification.

Key quotes

· 3 pulled
Ask five of the world's most advanced AI systems whether a statement is true, and two-thirds of the time, at least one will give you a different answer.
On 672 out of 1,000 claims, at least one model broke from the majority.
In 34% of cases, the disagreement was significant.
Snippet from the RSS feed
A new study gave five frontier AI models 1,000 real-world claims to fact-check. They disagreed on 67% of them.

You might also wanna read