Show HN: Local LLM Notepad – run a GPT-style model from a USB stick
By
davidye324
What it is A single 45 MB Windows .exe that embeds llama.cpp and a minimal Tk UI. Copy it (plus any .gguf model) to a flash drive, double-click on any Windows PC, and you’re chatting with an LLM—no admin rights, Cloud, or network.
Why I built it Existing “local LLM” GUIs assume you can pip install, pass long CLI flags, or download GBs of extras.
I wanted something my less-technical colleagues could run during a client visit by literally plugging in a USB drive.
How it works PyInstaller one-file build → bundles Python runtime, llama_cpp_python, and the UI into a single PE.
On first launch, it memory-maps the .gguf; subsequent prompts stream at ~20 tok/s on an i7-10750H with gemma-3-1b-it-Q4_K_M.gguf (0.8 GB).
Tick-driven render loop keeps the UI responsive while llama.cpp crunches.
A parser bold-underlines every token that originated in the prompt; Ctrl+click pops a “source viewer” to trace facts. (Helps spot hallucinations fast.)
Comments URL: https://news.ycombinator.com/item?id=44429116
Points: 8
# Comments: 1
You might also wanna read
From Bottleneck to Empowerer: Applying the Leader-Leader Model in Engineering Management
The article discusses the common challenge engineers face when transitioning from individual contributor roles to engineering leadership pos
Sergey Brin tells Google AI staff 60-hour workweek is the 'sweet spot' for productivity
Sergey Brin, Google cofounder, wrote an internal memo to employees working on the Gemini AI products recommending a 60-hour workweek as the
Solar desalination system eliminates toxic brine while producing fresh water
Scientists have developed a solar-powered desalination system that converts seawater into fresh water without producing toxic brine, a major
A design engineer's crisis: Quitting tech amid AI disillusionment
A design engineer reflects on quitting their stable, well-paying tech job amid growing disillusionment with the industry's direction, partic
Why being skeptical of AI is a rational and necessary choice
This opinion piece argues that skepticism and criticism of AI is a healthy, rational response rather than a sign of ignorance. It pushes bac
Chuwi Minibook X Review: A Modern Netbook Revival with Solid Specs
The Chuwi Minibook X is a modern take on the classic netbook — a 10.5-inch x86_64 sub-ultrabook with 16GB RAM and a 512GB NVMe drive. While
