Command-Line Tools Outperform Hadoop by 235x for Moderate-Scale Data Processing

Adam Drake is an advisor to scale-up tech companies. He writes about ML/AI/data, leadership, and building tech teams.

tosh5mo ago9 min readenInsight

You might also wanna read

Big data storage tools like BigQuery, Hadoop, and Cassandra are used to manage large volumes of structured and unstructured data generated b

Your humble laptop can only do so much. Here's your beginner's guide to computing clusters!

Schema migrations on a streaming ClickHouse table? At 100s of MB/s? Here's how we do it across clusters without losing a single bit.

Somehow we got stuck on the idea that big data systems should be slow. We're making it fast.

How we replaced Python pipelines with dlt, dbt, and Trino — and cut delivery time from weeks to one day.

We often get questions like this, and it's hard to properly convey in writing the potentially drastic latency differences between a typical

No comments yet. Be the first.