All Topics

Technology

Design

Programming

Science

News

Gaming

Entertainment

Business

Finance

Sports

Health

Food

Travel

Art

Music

Books

Education

Politics

Personal

GabrielBianconi

3 articles on Hacker News: Front Page

Appears on

Hacker News

Hacker News: Front Page

Articles3

Technical Implementation of DeepSeek LLM Deployment with Expert Parallelism on 96 H100 GPUs

lmsys.org9mo ago

Fine-Tuned Small LLMs Outperform Larger Models at 5-30x Lower Cost with Data Curation

tensorzero.com10mo ago

Supervised Fine-Tuning as Reinforcement Learning: Introducing Importance-Weighted SFT

arxiv.org10mo ago