MAKER System Solves Million-Step LLM Task with Zero Errors Through Extreme Decomposition
By
Anon84
Solid neighbourhood-bakery energy. Trustworthy and warm.
Summary
Researchers have developed MAKER, the first system to successfully solve a task requiring over one million LLM steps with zero errors, addressing the persistent challenge of scaling large language models for extended processes. The approach uses extreme task decomposition into subtasks handled by focused microagents, combined with multi-agent voting for error correction at each step. This massively decomposed agentic process (MDAP) architecture enables reliable scaling to organizational and societal problem-solving levels, suggesting an alternative to simply improving individual LLM capabilities.
Key quotes
· 5 pulledLLMs have achieved remarkable breakthroughs in reasoning, insights, and tool use, but chaining these abilities into extended processes at the scale of those routinely executed by humans, organizations, and societies has remained out of reach.
This paper describes MAKER, the first system that successfully solves a task with over one million LLM steps with zero errors, and, in principle, scales far beyond this level.
The approach relies on an extreme decomposition of a task into subtasks, each of which can be tackled by focused microagents.
The high level of modularity resulting from the decomposition allows error correction to be applied at each step through an efficient multi-agent voting scheme.
Thus, the results suggest that instead of relying on continual improvement of current LLMs, massively decomposed agentic processes (MDAPs) may provide a way to efficiently solve problems at the level of organizations and societies.
You might also wanna read
Demis Hassabis: AI will enable PhD students to match whole lab productivity
Demis Hassabis, Nobel laureate and Google DeepMind co-founder, stated at the 2026 Nobel Prize Dialogue in London that AI will soon enable a
Google DeepMind Proposes Cognitive Framework for Evaluating AGI Progress
Google DeepMind has proposed a new cognitive framework for evaluating progress toward Artificial General Intelligence (AGI), addressing the
Evaluating AI Agent Performance: Challenges Beyond Traditional Metrics
The article discusses the growing adoption of AI agents in real-world applications and the challenges in evaluating their performance. It ex
research.google·3mo agoApple Research Shows LLMs Can Recognize Activities from Audio and Motion Data
Apple researchers have published a study exploring how Large Language Models (LLMs) can analyze audio and motion sensor data to improve acti
9to5mac.com·6mo agoResearch on Introspective Capabilities in Large Language Models
This article discusses research from Anthropic on whether large language models can truly introspect and report on their own internal mechan
Experimental Study Reveals Ideological Biases in Leading Large Language Models
The article presents an experimental investigation into whether leading Large Language Models (LLMs) from companies like OpenAI and Google e
