All Topics
All Topics
Technology
Technology
Design
Design
Programming
Programming
Science
Science
News
News
Gaming
Gaming
Entertainment
Entertainment
Business
Business
Finance
Finance
Sports
Sports
Health
Health
Food
Food
Travel
Travel
Art
Art
Music
Music
Books
Books
Education
Education
Politics
Politics
Personal
Personal
No algorithm. No AI slop. No ads. Just RSS. Pro-human. Indie writers. Real journalism. Open web. Chronological. Hand toasted.

AI Agent Uses SQL to Analyze Billions of CI Log Lines for Debugging

By

shad42

3mo ago· 11 min readenInsight

Summary

The article describes how an AI agent was given SQL access to massive CI (Continuous Integration) log data, enabling it to autonomously investigate and trace flaky tests by querying terabytes of historical log data. The system processes 1.5 billion CI log lines and 700K jobs weekly, stored in ClickHouse with 35:1 compression, allowing millisecond query times. The AI agent can write its own SQL queries to scan hundreds of millions of log lines across multiple queries, following trails from job metadata to raw log output to identify root causes of issues in seconds.

Key quotes

· 4 pulled
Last week, our agent traced a flaky test to a dependency bump three weeks prior. It did this by writing its own SQL queries, scanning hundreds of millions of log lines across a dozen queries, and following a trail from job metadata to raw log output.
Every week, about 1.5 billion CI log lines and 700K jobs flow through our system. All of it lands in ClickHouse, compressed at 35:1. All of it is queryable in milliseconds.
The whole investigation took seconds.
To do this, the agent needs context: not one log file, but every build, every test, every log line, across months of history.
Snippet from the RSS feed
We gave our AI agent a SQL interface to billions of CI log lines in ClickHouse. How we ingest, store, and query 1.5 billion log lines a week.

You might also wanna read