All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

whichllm: An open-source CLI tool that ranks local LLMs by real benchmarks and hardware compatibility

By

andyyyy64

16d ago· 9 min readenCode

Summary

whichllm is an open-source CLI tool that auto-detects your GPU/CPU/RAM specs and ranks the best local LLMs from HuggingFace that actually run on your hardware. Unlike simple "what fits" tools, it ranks models by real, recency-aware benchmark performance rather than just parameter count or VRAM size. The tool emphasizes that a smaller, newer model can outperform a larger older one (e.g., ranking a 27B model above a 32B one due to better benchmarks). It provides one-command instant results with live HuggingFace data.

Key quotes

· 4 pulled
The 32B model fits your card fine — whichllm still ranks the 27B #1, because it scores higher on real benchmarks and is a newer generation.
A size-only 'what fits?' tool would hand you the bigger one. That gap is the whole point of whichllm.
Find the local LLM that actually runs and performs best on your hardware. Ranked by real, recency-aware benchmarks, not parameter count.
Auto-detects your GPU/CPU/RAM and ranks the top models from HuggingFace that fit your system.
Snippet from the RSS feed
Find the local LLM that actually runs and performs best on your hardware. Ranked by real, recency-aware benchmarks, not parameter count. One command, run it instantly. - Andyyyy64/whichllm

You might also wanna read