All Topics
All Topics
Technology
Technology
AI
AI
Business
Business
Entertainment
Entertainment
News
News
Programming
Programming
Security
Security
Science
Science
Design
Design
Environment
Environment
Finance
Finance
Crypto
Crypto
Politics
Politics
Sports
Sports
Education
Education
Gaming
Gaming
Art
Art
Music
Music
Health
Health
Books
Books
Food
Food
Travel
Travel
Personal
Personal
Bluesky
Twitter

Model routing cuts AI costs by matching tasks to the right model, challenging OpenAI and Anthropic's premium pricing model

By

Jasmine Wu, Deirdre Bosa

18d ago· 3 min readenNews

Summary

Model routing is an emerging practice where companies match AI tasks to appropriately sized models—sending complex problems to expensive frontier models and routine work to cheaper, faster alternatives. This approach can yield 5-10x cost efficiency gains for boilerplate work. Currently, most enterprise AI usage (roughly 95%) still runs on the most expensive models, indicating significant untapped cost-saving potential.

Source

bskyModel routing cuts AI costs by matching tasks to the right model, challenging OpenAI and Anthropic's premium pricing modelcnbc.com

Key quotes

· 3 pulled
Scott Wu, CEO of Cognition, which makes the coding agent Devin, said the gains on routine work are enormous.
For a lot of the boilerplate work, he said, companies can get five to 10 times better cost efficiency using models that are still good enough for the task.
Glean CEO Arvind Jain has estimated that roughly 95% of enterprise AI usage is still running on the most expensive models.
Snippet from the RSS feed
Companies are shifting from running everything on the most powerful AI model to matching each task to the right one, a practice called model routing.

You might also wanna read

Comments

Sign in to join the conversation.

No comments yet. Be the first.