Z80-μLM: A 2-Bit Quantized Language Model for Vintage Z80 Processors

Z80-μLM is a 2-bit quantized language model small enough to run on an 8-bit Z80 processor. Train conversational models in Python, export them as CP/M .COM binaries, and chat with your vintage compu...

Read the full article

quesomaster90006mo ago5 min readenCode

technology artificial intelligence programming retrocomputing

You might also wanna read

Parametric Memory Law: A Quantitative Framework for Understanding LoRA Memory Capacity in LLMs

Large Language Models (LLMs) must continuously learn and update knowledge to remain effective in dynamic real-world environments. While Low-

arxiv.org·1mo ago

Running small language models locally for agentic coding: A practical evaluation on Apple Silicon

Notes from my Thoughtworks colleagues on AI-assisted software delivery

martinfowler.com·8d ago

Accelerating Large-Scale LLM Inference on AMD Instinct MI350X/MI355X with Eagle3 and AMD Quark

Large language model (LLM) inference is increasingly constrained by autoregressive decoding. Even when prefill is highly optimized, the deco

AMD·14d ago

Nemotron-Labs-3-Puzzle-75B-A9B: A Compressed Hybrid MoE LLM for Efficient Interactive Deployment

We present Nemotron-Labs-3-Puzzle-75B-A9B, a compressed variant of Nemotron-3-Super optimized for interactive deployment. We designed the mo

arxiv.org·9d ago

Nemotron-Labs-3-Puzzle-75B-A9B: A Compressed Hybrid MoE LLM for Efficient Interactive Deployment

We present Nemotron-Labs-3-Puzzle-75B-A9B, a compressed variant of Nemotron-3-Super optimized for interactive deployment. We designed the mo

arxiv.org·9d ago

RTP-LLM: Alibaba's High-Performance Inference Engine for Large Language Model Deployment

Large Language Models (LLMs) have revolutionized AI applications, but deploying them at scale presents significant challenges. We present RT

arxiv.org·1mo ago

MiniMax M2.5 Review: Open-Weight Agentic LLM — 229B MoE, 80.2% SWE-Bench, BFCL Leader, $1.15/M Output

MiniMax M2.5 (February 12, 2026) is the Chinese AI company's open-weight flagship: 229 billion total parameters, 10 billion active (Sparse M

chatforest.com·2mo ago

Comments

No comments yet. Be the first.