Seeknal: A CLI Tool for Data & AI/ML Pipelines with Natural Language Queries
By
Fitra Kacamarga
Lacks bite. And filling. And a copy-editor at the bakery.
Summary
Seeknal is a CLI tool for data and AI/ML engineering that allows users to define pipelines in YAML or Python, run a safe draft→dry-run→apply workflow, materialize data to PostgreSQL and Iceberg, and query data using natural language. It organizes around three core verbs: Organize (data transformation), Expose (dashboards and NL queries), and Action (insights to reports/APIs/alerts). The tool is designed for the agent-based computing world.
Key quotes
· 4 pulledDefine pipelines in YAML or Python, run a safe draft → dry-run → apply workflow
Materialize to PostgreSQL and Iceberg, and query your data in natural language
Three verbs: Organize (transform raw data, point-in-time joins, incrementals), Expose (dashboards, features, NL query), Action (insight → report → API → alert)
Built for the agent world
You might also wanna read
Kore: A New High-Performance Columnar File Format for Big Data Analytics
Kore is a new high-performance binary file format for analytical workloads, claiming superior compression (38% vs 63% for Parquet), 131x que
How Mindbox replaced PySpark with YAML-based pipelines using dlt, dbt, and Trino
Data engineer Kiril Kazlou describes how Mindbox replaced PySpark-based data pipelines with a stack using dlt, dbt, and Trino, configured th
Six SQL patterns for detecting transaction fraud in benefit programs
A data professional on a program-integrity team shares six practical SQL patterns for detecting transaction fraud in government benefit prog
whichllm: An open-source CLI tool that ranks local LLMs by real benchmarks and hardware compatibility
whichllm is an open-source CLI tool that auto-detects your GPU/CPU/RAM specs and ranks the best local LLMs from HuggingFace that actually ru
Intel's AutoRound: Open-Source Quantization Toolkit for Low-Bit LLM and VLM Inference
AutoRound is an advanced quantization toolkit for Large Language Models (LLMs) and Vision-Language Models (VLMs) developed by Intel. It achi
Rocky: A Rust-Based Control Plane for Data Warehouse Pipeline Management
Rocky is a Rust-based control plane for data warehouse pipelines that provides branching, replay, column-level lineage, compile-time safety,
