All Topics

Technology

Art

Dual RTX 5080 + RTX 3090 Setup Achieves 80+ Tok/s on Qwen 3.6 27B Q8 with 39GB VRAM

iMil

9h ago· 4 min readen

80/100

Golden Brown

Bagelometer↗

Pulled from the oven just right. Trustworthy, fact-dense, deeply satisfying.

Score80Typehow-toSentimentpositive

Summary

A user describes building a dual GPU setup combining an RTX 5080 (16GB) with a refurbished RTX 3090 (24GB) to achieve 39GB total VRAM, enabling them to run Qwen 3.6 27B at Q8 quantization at 80+ tokens per second. The article details the hardware journey from a single 5080 to a dual-card configuration for local LLM experimentation.

Key quotes

· 3 pulled

A year ago, I bought an RTX 5080 for both gaming and AI experiments.

Little did I know back then that I would be giving into the joys of local LLM setups.

So I began digging what kind of setup could take profit of those 2 cards together.

Snippet from the RSS feed

Dual GPU setup: run Qwen 3.6 27B at a Q8 quantization at 80+ tokens/sec with 39GB total VRAM

You might also wanna read

Guide to Calculating GPU Memory for Self-Hosted LLM Inference

The article provides a guide on calculating GPU memory requirements and managing concurrent requests for self-hosted large language model (L

Product Hunt·10mo ago

APEX4: Platform-Dependent W4A4 LLM Inference via Intra-SM Compute Rebalancing

This paper presents APEX4, a system for efficient W4A4 (4-bit weights, 4-bit activations) LLM inference that addresses the bottleneck of gro

arxiv.org·3d ago

XMG Returns to Large-Format Laptops with PRO 18 Featuring RTX 5070 Ti and 12 GB GDDR7 at Computex 2026

At Computex 2026, XMG announced the PRO 18 and PRO 18 Value Edition (VE) notebooks, marking the company's return to the large-format laptop

techpowerup.com·8d ago

Microsoft's Surface Laptop Ultra to feature Nvidia RTX Spark chip with up to 128GB unified memory

Microsoft is developing the Surface Laptop Ultra, a high-end RTX Spark system powered by Nvidia's new Arm-based chip for Windows PCs. It wil

arstechnica.com·11d ago

Leaker Claims Nvidia RTX 50 Super GPUs Still on Track for 2025 Launch with 50% More VRAM

An X leaker (@Zed_Wang) claims Nvidia's RTX 50 Super series GPUs are back on track for a later 2025 launch, after previous reports of delays

bit.ly·7d ago

Microsoft Surface Laptop Ultra: High-end RTX Spark workstation with 128GB unified memory coming later this year

Microsoft is developing the Surface Laptop Ultra, a high-end RTX Spark system powered by Nvidia's new Arm-based chip for Windows PCs. It wil

ift.tt·11d ago