GPT-5 Launch Raises Concerns Over Performance and Expectations
By
csmeyer
A respectable bake. You'd come back tomorrow for another.
Summary
The article discusses concerns about GPT-5's launch, highlighting its failure to correctly answer a classic LLM trick despite being touted as a 'PhD level expert in your pocket.' Screenshots of the issue circulated on Bluesky, and the author confirmed the problem by querying GPT-5 themselves. The piece reflects skepticism about GPT-5's capabilities compared to earlier expectations of it being an 'AGI moment.'
Key quotes
· 2 pulledSam Altman touted GPT-5 as a 'PhD level expert in your pocket', but this PhD doubled down on incorrectly answering the oldest trick for LLMs in the book.
When GPT-4 launched, I (and many others) believed that GPT-5's launch would be the 'AGI moment'.
You might also wanna read

GPT-5 Launch Falls Short of Hype Despite Improvements
OpenAI's GPT-5 launch was highly anticipated, with CEO Sam Altman comparing it to the first iPhone with a Retina display. Despite the hype,

OpenAI's GPT-5 Livestream Charts Reveal Inconsistencies
OpenAI's GPT-5 livestream showcased charts with inconsistencies, such as a misleading graph on 'deception evals across models.' The CEO ackn

OpenAI Releases GPT-5 for All ChatGPT Users, Marking a Major AI Advancement
OpenAI is launching GPT-5, its latest AI model, for all ChatGPT users and developers. CEO Sam Altman describes GPT-5 as a significant advanc

OpenAI launches GPT-5.5 with improved coding and cross-tool capabilities
OpenAI has announced GPT-5.5, its latest AI model, just one month after releasing GPT-5.4. The company claims the new model excels at writin

OpenAI to Improve GPT-5 with Lessons from GPT-4o Backlash
OpenAI acknowledges the backlash over discontinuing GPT-4o and plans to improve GPT-5 by incorporating the 'warmth' of GPT-4o while addressi

Sam Altman Discusses GPT-5 Rollout and OpenAI's Future Plans in Exclusive Interview
The article recounts an extended dinner interview with OpenAI CEO Sam Altman, where he discussed a wide range of topics, including the contr
