All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
Bluesky
Twitter
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

We trained a ~frontier Deep Research Agent on academic budget > 32 H100s

1d ago

Source

Twitter / XWe trained a ~frontier Deep Research Agent on academic budget > 32 H100sosu-nlp-group.github.io
Snippet from the RSS feed
We trained a ~frontier Deep Research Agent on academic budget > 32 H100s > 8K synthetic samples > fully open training infra + recipe (SFT, mid-training, RL) > models of diff sizes (2B -> 35B) ready to use out of the box This is yet another demonstration of how the frontier of AI is changing. We have reached a point where open models + a small capable team + a few hundred Ks can produce specialized models with ~frontier capabilities. The future of AI doesn’t have to be held in a chokehold by a handful of closed models. We've open-sourced everything we've built and learned from this project. Hope it helps the community build more! 📌 Project: 📌 Paper: 📌 Code: 📌 Model Weights and Data: 📌 Demo: Amazing effort led by @jianxie_ (our 1st year student!!), Tianhe Lin, Zilu Wang. joint with @hhsun1 and the @osunlp team. thanks @amazon Xiangjun Wang for a gift that covers the compute and fruitful discussion.

You might also wanna read