Study Finds 67% Disagreement Rate Among Top AI Models on Real-World Fact-Checks
A research study by Lenz Research tested five frontier LLMs on 1,000 real-world fact-check claims submitted by users to a fact-checking platform. The study found that 67% of the time, the top AI models disagreed on the verdict. Unlike benchmark tests with public answer keys, thes