All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

LLMs vs. Classical HPO Algorithms: Hybrid Approach Outperforms Both in Hyperparameter Optimization

By

[Submitted on 25 Mar 2026 (v1), last revised 17 Apr 2026 (this version, v5)]

1d ago· 2 min readenInsight

Summary

This research paper compares classical hyperparameter optimization (HPO) algorithms (CMA-ES, TPE) against LLM-based methods for tuning a small language model under a fixed compute budget. Classical methods consistently outperform pure LLM agents, especially in avoiding out-of-memory failures. Allowing LLMs to directly edit source code narrows the gap but doesn't close it, even with frontier models like Claude Opus 4.6 and Gemini 3.1 Pro Preview. The authors introduce Centaur, a hybrid approach that shares CMA-ES's interpretable internal state with an LLM, achieving the best results. A 0.8B parameter LLM in Centaur already outperforms all classical and pure LLM methods. The findings suggest LLMs are most effective as complements to classical optimizers, not replacements.

Key quotes

· 5 pulled
When defining a fixed search space over autoresearch, classical methods such as CMA-ES and TPE consistently outperform LLM-based agents, where avoiding out-of-memory failures matters more than search diversity.
We observe that LLMs struggle to track optimization state across trials. In contrast, classical methods lack the domain knowledge of LLMs.
To combine the strengths of both, we introduce Centaur, a hybrid that shares CMA-ES's interpretable internal state, including mean vector, step-size, and covariance matrix, with an LLM.
Centaur achieves the best result in our experiments, and a 0.8B LLM already suffices to outperform all classical and pure LLM methods.
All in all, our results suggest that LLMs are most effective as a complement to classical optimizers, not as a replacement.
Snippet from the RSS feed
The autoresearch repository enables an LLM agent to optimize hyperparameters by editing training code directly. We use it as a testbed to compare classical HPO algorithms against LLM-based methods on tuning the hyperparameters of a small language model un

You might also wanna read