All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

History-LLMs Project: Developing Large Language Models for Historical Analysis with Ranke-4B Release

By

iamwil

5mo ago· 8 min readenCode

Summary

The article describes the History-LLMs project, which focuses on training large language models specifically for historical analysis. The project is developing a family of 4 billion parameter models called Ranke-4B, based on the Qwen3 architecture, trained from scratch on 80 billion tokens of historical data with specific knowledge cutoffs (1913, 1929, 1933, 1939, 1946). The project uses a curated dataset of 600 billion tokens of time-stamped text and acknowledges support from Lambda AI research credits. The content serves as an information hub for the project's goal of training the largest possible historical LLMs.

Key quotes

· 4 pulled
A family of 4 billion (B) parameter large language models (LLMs) based on the Qwen3 architecture trained from scratch on 80B tokens of historical data up to knowledge-cutoffs $\in {1913, 1929, 1933, 1939, 1946}$
Information hub for our project training the largest possible historical LLMs
We gratefully acknowledge research credits provided by Lambda AI
using a curated dataset of 600B tokens of time-stamped text
Snippet from the RSS feed
Information hub for our project training the largest possible historical LLMs. - DGoettlich/history-llms

You might also wanna read