A Skeptical Look at GPT5's Coding Capabilities
By
sohkamyung
Hand-rolled, kettle-boiled, baked to perfection. Worth every minute at the bakery.
Summary
The article is a critique of the capabilities of large language models (LLMs), specifically GPT5, in generating functional code. The author recounts a test where they asked GPT5 to solve a specific coding problem in Swift without third-party dependencies, highlighting the model's limitations despite claims of its advancement. The piece reflects the author's skepticism and frustration with the hype around LLMs.
Key quotes
· 3 pulledA contact just told me that my old 'LLMs generate nonsense code' blog post from 2 years ago is now very outdated with GPT5 because it’s so awesome and so helpful.
Without adding third-party dependencies, how can I compress a Data stream with zstd in Swift on an iPhone?
Yet another LLM rant
You might also wanna read
OpenAI Unveils GPT-5: Enhanced Reasoning and Coding Capabilities
OpenAI's GPT-5 is an advanced model with significant improvements in reasoning, code quality, and user experience. It excels in handling com

GPT-5 Launch Falls Short of Hype Despite Improvements
OpenAI's GPT-5 launch was highly anticipated, with CEO Sam Altman comparing it to the first iPhone with a Retina display. Despite the hype,

OpenAI launches GPT-5.5 with improved coding and cross-tool capabilities
OpenAI has announced GPT-5.5, its latest AI model, just one month after releasing GPT-5.4. The company claims the new model excels at writin

OpenAI Launches GPT-5.4 with Native Computer Control Capabilities
OpenAI has launched GPT-5.4, its latest AI model featuring native computer use capabilities that allow it to operate computers and complete

OpenAI Releases GPT-5 for All ChatGPT Users, Marking a Major AI Advancement
OpenAI is launching GPT-5, its latest AI model, for all ChatGPT users and developers. CEO Sam Altman describes GPT-5 as a significant advanc
Datacurve's DeepSWE Benchmark Shows GPT-5.5 Leading AI Coding Models with 70% Pass Rate
A new benchmark called DeepSWE, released by startup Datacurve, reveals significant performance differences among AI coding models that were
