All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

New Variant of DeepSeek AI Model Developed by German Lab TNG Technology Consulting GmbH

By

saubeidl

11mo ago· 8 min readenNews

Summary

A new variant of the DeepSeek AI model, R1-0528, developed by German lab TNG Technology Consulting GmbH, is 200% faster than its predecessor. This improvement is attributed to TNG's Assembly-of-Experts method for building LLMs.

Key quotes

· 2 pulled
Like its predecessor, DeepSeek-R1 — which rocked the AI and global business communities with how cheaply it was trained and how well it performed on reasoning tasks, all available to developers and enterprises for free — R1-0528 is already being adapted and remixed by other A
This gain is made possible by TNG’s Assembly-of-Experts (AoE) method — a technique for building LLMs by selectively merging the weight tensors
Snippet from the RSS feed
This gain is made possible by TNG’s Assembly-of-Experts (AoE) method — a technique for building LLMs by selectively merging the weight tensors

You might also wanna read