Dual RTX 5080 + RTX 3090 Setup Achieves 80+ Tok/s on Qwen 3.6 27B Q8 with 39GB VRAM
By
iMil
Pulled from the oven just right. Trustworthy, fact-dense, deeply satisfying.
Summary
A user describes building a dual GPU setup combining an RTX 5080 (16GB) with a refurbished RTX 3090 (24GB) to achieve 39GB total VRAM, enabling them to run Qwen 3.6 27B at Q8 quantization at 80+ tokens per second. The article details the hardware journey from a single 5080 to a dual-card configuration for local LLM experimentation.
Key quotes
· 3 pulledA year ago, I bought an RTX 5080 for both gaming and AI experiments.
Little did I know back then that I would be giving into the joys of local LLM setups.
So I began digging what kind of setup could take profit of those 2 cards together.
You might also wanna read
Guide to Calculating GPU Memory for Self-Hosted LLM Inference
The article provides a guide on calculating GPU memory requirements and managing concurrent requests for self-hosted large language model (L
APEX4: Platform-Dependent W4A4 LLM Inference via Intra-SM Compute Rebalancing
This paper presents APEX4, a system for efficient W4A4 (4-bit weights, 4-bit activations) LLM inference that addresses the bottleneck of gro
XMG Returns to Large-Format Laptops with PRO 18 Featuring RTX 5070 Ti and 12 GB GDDR7 at Computex 2026
At Computex 2026, XMG announced the PRO 18 and PRO 18 Value Edition (VE) notebooks, marking the company's return to the large-format laptop
Microsoft's Surface Laptop Ultra to feature Nvidia RTX Spark chip with up to 128GB unified memory
Microsoft is developing the Surface Laptop Ultra, a high-end RTX Spark system powered by Nvidia's new Arm-based chip for Windows PCs. It wil
arstechnica.com·11d agoLeaker Claims Nvidia RTX 50 Super GPUs Still on Track for 2025 Launch with 50% More VRAM
An X leaker (@Zed_Wang) claims Nvidia's RTX 50 Super series GPUs are back on track for a later 2025 launch, after previous reports of delays
Microsoft Surface Laptop Ultra: High-end RTX Spark workstation with 128GB unified memory coming later this year
Microsoft is developing the Surface Laptop Ultra, a high-end RTX Spark system powered by Nvidia's new Arm-based chip for Windows PCs. It wil
ift.tt·11d ago