writing
notes on data engineering, open-source tools, and research
Lineage Is a Graph Problem: Tracing Revenue to Raw Data With PuppyGraph Jun 17, 2026
Testing MotherDuck for Concurrent FHIR Analytics over S3 Parquet Jun 9, 2026
Postgres to ClickHouse CDC for Trade Surveillance Jun 9, 2026
Firebolt, Trino, and Iceberg Pruning Internals Jun 7, 2026
Auditing XTable-Generated Iceberg Metadata in Firebolt Jun 6, 2026
One Row Changed in Hudi. The Vector Index Resynced Itself. Jun 2, 2026
The AI Didn't Hallucinate. The Data Did. May 22, 2026
ClickHouse vs DuckDB on Real FHIR Data: What the Benchmark Actually Shows May 15, 2026
Connecting LanceDB to PuppyGraph: Building the Proxy That Should Not Exist May 10, 2026
I Built a Snowflake Query Savings Estimator (Inspired by Greybeam) May 8, 2026
PuppyGraph: Graph Analytics Without the Overhead May 1, 2026
FHIR to Parquet: 39x Faster Queries on Healthcare Data Apr 20, 2026