All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

ATLAS: Adaptive Test-time Learning System Achieves 74.6% Code Benchmark Performance with Frozen 14B Model

By

yogthos

2mo ago· 9 min readenCode

Summary

ATLAS (Adaptive Test-time Learning and Autonomous Specialization) is a system that wraps a frozen smaller language model (14B parameters) with intelligent infrastructure to achieve 74.6% LiveCodeBench pass@1-v(k=3) performance on a single consumer GPU, up from 36-41% in previous versions. The approach uses constraint-driven generation, energy-based verification, and self-verified iterative refinement without fine-tuning, API calls, or cloud dependencies. The system is fully self-hosted, ensuring no data leaves the machine, and aims to compete with frontier API models at a fraction of the cost.

Key quotes

· 3 pulled
A.T.L.A.S achieves 74.6% LiveCodeBench pass@1-v(k=3) with a frozen 14B model on a single consumer GPU -- up from 36-41% in V2
The premise: wrap a frozen smaller model in intelligent infrastructure -- structured generation, energy-based verification, self-verified repair -- and it can compete with frontier API models at a fraction of the cost
No fine-tuning, no API calls, no cloud. Fully self-hosted -- no data leaves the machine
Snippet from the RSS feed
Adaptive Test-time Learning and Autonomous Specialization - itigges22/ATLAS

You might also wanna read