AI Agent Uses SQL to Analyze Billions of CI Log Lines for Debugging
By
shad42
Toasted golden, schmeared with insight. Top of the rack.
Summary
The article describes how an AI agent was given SQL access to massive CI (Continuous Integration) log data, enabling it to autonomously investigate and trace flaky tests by querying terabytes of historical log data. The system processes 1.5 billion CI log lines and 700K jobs weekly, stored in ClickHouse with 35:1 compression, allowing millisecond query times. The AI agent can write its own SQL queries to scan hundreds of millions of log lines across multiple queries, following trails from job metadata to raw log output to identify root causes of issues in seconds.
Key quotes
· 4 pulledLast week, our agent traced a flaky test to a dependency bump three weeks prior. It did this by writing its own SQL queries, scanning hundreds of millions of log lines across a dozen queries, and following a trail from job metadata to raw log output.
Every week, about 1.5 billion CI log lines and 700K jobs flow through our system. All of it lands in ClickHouse, compressed at 35:1. All of it is queryable in milliseconds.
The whole investigation took seconds.
To do this, the agent needs context: not one log file, but every build, every test, every log line, across months of history.
You might also wanna read
AgenticLens: Visual Debugging Tool for AI Agent Workflows
AgenticLens is a tool that helps developers debug AI agent workflows by converting messy logs into a visual workspace with flow views, timel
TraceRoot.AI launches open-source AI observability platform for faster debugging
TraceRoot.AI is an open-source, AI-native observability and debugging platform that helps developers fix production bugs faster. It consolid
Comie.dev: A production context layer that connects AI coding agents to live logs, databases, and error tracking
Comie.dev is a new tool that acts as a production context layer for AI coding agents like Claude Code, Cursor, and Codex. It connects AI too
Tracea: Self-hosted monitoring and debugging tool for AI agents with automatic RCA and team memory
Tracea is a self-hosted monitoring and debugging tool for AI agents, similar to Datadog but purpose-built for agent workflows. It captures a
Atla: AI Agent Debugging Tool Launches to Automatically Detect and Cluster Failures
Atla is a new evaluation tool for AI agents that automatically detects and clusters step-level failures into recurring patterns, helping dev
Claude Code Launches Multi-Agent AI Code Review System for Bug Detection
Anthropic's Claude Code now offers a multi-agent AI code review system that analyzes pull requests to catch bugs, security issues, and logic
