writing

notes on data engineering, open-source tools, and research

Lineage Is a Graph Problem: Tracing Revenue to Raw Data With PuppyGraph
Jun 17, 2026
Testing MotherDuck for Concurrent FHIR Analytics over S3 Parquet
Jun 9, 2026
Postgres to ClickHouse CDC for Trade Surveillance
Jun 9, 2026
Firebolt, Trino, and Iceberg Pruning Internals
Jun 7, 2026
Auditing XTable-Generated Iceberg Metadata in Firebolt
Jun 6, 2026
One Row Changed in Hudi. The Vector Index Resynced Itself.
Jun 2, 2026
The AI Didn't Hallucinate. The Data Did.
May 22, 2026
ClickHouse vs DuckDB on Real FHIR Data: What the Benchmark Actually Shows
May 15, 2026
Connecting LanceDB to PuppyGraph: Building the Proxy That Should Not Exist
May 10, 2026
I Built a Snowflake Query Savings Estimator (Inspired by Greybeam)
May 8, 2026
PuppyGraph: Graph Analytics Without the Overhead
May 1, 2026
FHIR to Parquet: 39x Faster Queries on Healthcare Data
Apr 20, 2026