Forge: A Python framework for reliable self-hosted LLM tool-calling and multi-step agent workflows
By
zambelli
The bagel they save for the regulars. Don't skim, savour.
Summary
Forge is a Python framework designed to improve self-hosted LLM tool-calling and multi-step agentic workflows. It acts as a reliability layer for local models (like 8B parameter models), boosting their performance through guardrails (rescue parsing, retry nudges, step enforcement) and context management (VRAM-aware budgets, tiered compaction). The top self-hosted configuration achieves 86.5% across a 26-scenario evaluation suite. The framework offers three usage modes: WorkflowRunner for structured agent loops, a CLI for interactive sessions, and a server mode for API-based deployments.
Key quotes
· 3 pulledForge lifts an 8B local model to the top of its class on multi-step agentic workflows through guardrails (rescue parsing, retry nudges, step enforcement) and context management (VRAM-aware budgets, tiered compaction).
The current top self-hosted config (Ministral-3 8B Instruct Q8 on llama-server) scores 86.5% across forge's 26-scenario eval suite — and 76% on the hardest tier.
WorkflowRunner — Define tools, pick a backend, run structured agent loops. Forge manages the full lifecycle: system prompts, tool execution, error recovery, and context budgets.
You might also wanna read
GhostForge: Local AI Agent Development Platform for Offline Workflow Automation
GhostForge is a software tool that enables users to build, run, and customize AI agents entirely on local hardware without cloud dependency.
Forge: A React component library with 100+ UI components and AI integration for rapid development
Forge is a modern React component library offering 100+ production-ready UI components with dark/light themes, TypeScript support, and AI in
InsForge: Open-Source Backend Platform for AI Agent Development
InsForge is an open-source backend platform designed specifically for AI agent development, providing essential infrastructure components li
Factory: AI Coding Agents That Integrate with Existing Developer Workflows
Factory is a platform for AI coding agents called 'Droids' that integrates with developers' existing workflows and tools. Unlike other AI de
Microsoft Foundry Skill: Reusable Guidance for Coding Agent Workflows
The Microsoft Foundry Skill is a meta-skill for coding agents that provides reusable guidance for Foundry agent workflows. It helps standard
