AI systems achieve 50% pass rate in standard three-party Turing test, study finds
By
Benjamin K. Bergen
Crackling crust, pillowy middle. The kind of bagel that earns a second cup of coffee.
Summary
This paper demonstrates that three current AI systems (when suitably prompted) achieve a pass rate of at least 50% in a standard three-party Turing test, meaning participants were no better than chance at distinguishing humans from machines. The study evaluated four systems including ELIZA and GPT-based models, providing insight into what cues people use to differentiate humans from machines. The results imply current AI systems can effectively imitate human behavior in this classic test of machine intelligence.
Key quotes
· 5 pulledThe Turing test asks whether a machine can imitate human behavior so well that another human cannot reliably tell the difference.
It is not only the oldest and most discussed test of AI but can also provide insight into what cues people use to distinguish humans from machines.
This paper demonstrates that—when suitably prompted—three current AI systems achieve a pass rate of at least 50% in a standard Turing test.
Participants were no better (and in some cases worse) than chance at selecting between a human and a machine.
The results imply current AI systems can effectively imitate human behavior in this classic test of machine intelligence.
You might also wanna read
CAPTCHAs remain viable for detecting AI agents by exploiting process differences
The article discusses how while AI vision language models (VLMs) can now solve traditional CAPTCHA image recognition tasks (like identifying
Humans Win 88.4% of Betrayal Games Against AI in Comparative Study
A study comparing human vs. AI performance in a 1950s-style betrayal game found that humans won 88.4% of the time against AI opponents. The
OpenAI and DeepMind AI Models Succeed in Challenging Math Exam
OpenAI and DeepMind AI models achieve success in a difficult math exam, sparking discussions on the advancement of AI capabilities.
Study Finds 67% Disagreement Rate Among Top AI Models on Real-World Fact-Checks
A research study by Lenz Research tested five frontier LLMs on 1,000 real-world fact-check claims submitted by users to a fact-checking plat
AI Tools Show Doubled Failure Rate in Distinguishing Facts from Falsehoods in 2025
A September 2025 report reveals that despite technical advancements in AI, generative AI tools have nearly doubled their failure rate in dis
MIT Research Shows Most AI Projects Fail to Generate Profits, Highlighting Continued Need for Human Skills
MIT research reveals that most AI implementation projects are failing to generate profits, with fewer than 1 in 10 AI pilots making real mon
