All Topics

Technology

Business

Entertainment

News

Programming

Science

Design

Environment

Finance

Crypto

Politics

Sports

Education

Gaming

Art

Music

Health

Security

Books

Food

Travel

Personal

How Mindbox replaced PySpark with YAML-based pipelines using dlt, dbt, and Trino

How we replaced Python pipelines with dlt, dbt, and Trino — and cut delivery time from weeks to one day.

Read the full article

Kiril Kazlou1mo ago11 min readenInsight

technology programming analytics data engineering

You might also wanna read

Exploring Data Operations with PySpark, Pandas, DuckDB, Polars, and DataFusion in a Python Notebook

> **Cross-posted.** This article's canonical home is [Iceberg Lakehouse]( - [A...

datalakehousehub.com·1y ago

Exploring Data Operations with PySpark, Pandas, DuckDB, Polars, and DataFusion in a Python Notebook

- [Apache Iceberg Crash Course: What is a Data Lakehouse and a Table Format?](

iceberglakehouse.com·1y ago

MCP and Data Pipelines: How AI Agents Connect to Airflow, dbt, Kafka, Snowflake, and the Modern Data Stack

A comprehensive guide to MCP integrations across the modern data stack — covering Airflow, dbt, Kafka, Snowflake, BigQuery, Databricks, Five

chatforest.com·3mo ago

Databricks Runtime 18: June 10, 2026

Databricks Runtime 18 is now generally available (GA). For lifecycle details, see Databricks Runtime support lifecycles . New features and i

Microsoft·1mo ago

End-to-End Basic Data Engineering Tutorial (Spark, Dremio, Superset)

> **Cross-posted.** This article's canonical home is [Iceberg Lakehouse](

datalakehousehub.com·2y ago

Databricks Runtime 18: June 10, 2026

Databricks Runtime 18 is now generally available (GA). For lifecycle details, see Databricks Runtime support lifecycles . New features and i

Databricks Community·1mo ago

Comments

Sign in to join the conversation.

No comments yet. Be the first.