AI Compute Scarcity Emerges as GPU Rental Prices Surge 48% in Two Months
By
gmays
A bagel you'd recommend to a friend without hedging.
Summary
The article discusses the emerging scarcity of AI computing resources, particularly GPU chips from Nvidia. GPU rental prices for Nvidia's Blackwell chips have surged 48% in just two months, from $2.75 to $4.08 per hour. Companies like CoreWeave have raised prices by 20% and extended minimum contracts from one to three years. OpenAI's CFO Sarah Friar notes they're making "tough trades" on projects due to insufficient compute, while Anthropic has limited access to its newest model. This scarcity is reshaping the AI industry, forcing startups to compete on infrastructure access rather than iteration speed.
Key quotes
· 4 pulledGPU rental prices for Nvidia's Blackwell chips hit $4.08 per hour this week, up 48% from $2.75 just two months ago.
CoreWeave raised prices 20% & extended minimum contracts from one year to three.
"We're making some very tough trades at the moment on things we're not pursuing because we don't have enough compute." - Sarah Friar, OpenAI CFO
This scarcity is already reshaping access. Anthropic has limited its newest model to roughly
You might also wanna read
Companies seek cheaper AI alternatives as costs rise and ROI remains unclear
Corporations are increasingly seeking cheaper AI models as costs from major AI labs like Anthropic and OpenAI blow out IT budgets without cl
Major exchanges developing futures markets for AI tokens and GPU rentals
Major financial exchanges including China's Shanghai Futures Exchange, CME Group, and the Intercontinental Exchange (NYSE owner) are develop

Financial Risks in AI Data Center Boom: Nvidia's Neocloud Investments and GPU-Collateralized Debt
The article examines the financial risks in the AI data center boom, focusing on how Nvidia's investments have created a class of 'neocloud'
AI boom outpaces data center infrastructure, creating dangerous misalignment
The rapid expansion of AI is outpacing data center infrastructure development, particularly in the US where hyperscalers and cloud providers
General Compute Launches ASIC-Based Inference Cloud for Faster AI Agent Performance
General Compute is an inference cloud built on ASICs (purpose-built alternatives to Nvidia GPUs) designed specifically for AI inference, not
Unrestricted AI tool usage leads to massive cost overruns at major tech companies
Unrestricted access to advanced AI tools like Anthropic's Claude has led to massive unexpected costs for major companies. One unnamed enterp
