All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

Sarvam AI Open-Sources 30B and 105B Reasoning Models Trained in India

By

logicchains

2mo ago· 48 min readen

Summary

Sarvam AI is open-sourcing two large language models - Sarvam 30B and Sarvam 105B - which are reasoning models trained entirely in India using compute resources from the IndiaAI mission. The models were developed through a full-stack approach including custom datasets, optimized tokenization, model architecture, execution kernels, scheduling, and inference systems. They are Mixture of Experts (MoE) models that show strong performance on Indian-language benchmarks and are being released with weights available on AI Kosh and Hugging Face platforms.

Key quotes

· 5 pulled
We're releasing Sarvam 30B and Sarvam 105B as open-source models.
Both are reasoning models trained from scratch on large-scale, high-quality datasets curated in-house across every stage of training: pre-training, supervised fine-tuning, and reinforcement learning.
Training was conducted entirely in India on compute provided under the IndiaAI mission.
These models represent a true full-stack effort.
Beyond datasets, we optimized tokenization, model architecture, execution kernels, scheduling, and inference systems to make deployment efficient across a wide range of hardware.
Snippet from the RSS feed
Sarvam open-sources 30B and 105B reasoning models trained in India—MoE LLMs with leading Indian-language benchmarks; weights on AI Kosh and Hugging Face.

You might also wanna read