All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Answer.AI Develops System to Train 70 Billion Language Model on Regular Desktop

By

amrrs

10mo ago· 20 min readenNews

Summary

Answer.AI has developed an open-source system that can efficiently train a 70 billion large language model on a regular desktop computer with two or more standard gaming GPUs. The system is a collaboration between Answer.AI, Tim Dettmers from U Washington, and Hugging Face's Titus von Koeller and Sourab Mangrulkar, combining FSDP and QLoRA technologies. This advancement will benefit the open-source community by enabling the release of better models.

Key quotes

· 3 pulled
Today, we’re releasing Answer.AI’s first project: a fully open source system that, for the first time, can efficiently train a 70b large language model on a regular desktop computer with two or more standard gaming GPUs (RTX 3090 or 4090).
This system will help the open source community release better models.
We’re releasing an open source system, based on FSDP and QLoRA, that can train a 70b model on two 24GB GPUs.
Snippet from the RSS feed
We’re releasing an open source system, based on FSDP and QLoRA, that can train a 70b model on two 24GB GPUs.

You might also wanna read