Boom Times for Inference Providers?
By
Laura Mandaro, Stephanie Palazzolo
12h ago· 10 min readNews
Less than a year ago, our reporters kept hearing doubts about a group of startups called inference providers. Companies like Fireworks, Baseten and Together AI, which rent out Nvidia servers to app developers and help them customize open-source models, ha
You might also wanna read

AI pricing subsidy era ends as inference costs rise instead of falling
The article discusses the end of the "AI subsidy era" where companies offered AI features at flat-rate or low prices based on the assumption
arnon.dk·9d agoAI Compute Scarcity Emerges as GPU Rental Prices Surge 48% in Two Months
The article discusses the emerging scarcity of AI computing resources, particularly GPU chips from Nvidia. GPU rental prices for Nvidia's Bl
General Compute Launches ASIC-Based Inference Cloud for Faster AI Agent Performance
General Compute is an inference cloud built on ASICs (purpose-built alternatives to Nvidia GPUs) designed specifically for AI inference, not
Open-Source AI Coding Tools Surge as Users Shift from Throttled Platforms
The article discusses the rapid growth of open-source AI coding tools like Kilo, Cline, and Roo, driven by user migration from throttled pla
Deconstructing AI Inference Costs: Why OpenAI and Anthropic May Be More Profitable Than Claimed
The article challenges the common narrative that AI companies like OpenAI and Anthropic are losing money on inference (running AI models at
