Blog
Notes from the agent loop.
Engineering posts, FinOps deep dives, and lessons from running autonomous remediation in production.
Databricks cost optimization is not a dashboard problem
Why Databricks FinOps teams need ownership, workflow, and verified action after cost visibility is in place.
Read postWhat Databricks system tables tell you about cost, and what they still do not
System tables are the foundation for Databricks observability, but cost optimization still needs context, ownership, and action.
Read postHow to turn Databricks billing usage into verified savings
A practical workflow for moving from Databricks usage evidence to approved fixes and read-back verification.
Read postDatabricks query history is the missing link between performance and cost
How query history helps connect slow SQL workloads, warehouse spend, ownership, and optimization opportunities.
Read postHow to use AI for Databricks optimization without exposing customer data
Why metadata and telemetry are enough for Databricks FinOps and reliability agents, and how to keep AI reasoning inside enterprise boundaries.
Read post50+ Databricks waste and reliability detectors platform teams should track
A practical map of detector categories across Databricks cost waste, Spark reliability drift, Delta table hygiene, ownership, and remediation opportunities.
Read postFrom alerts to autonomous remediation for the lakehouse
A practical playbook for moving Databricks operations from dashboard alerts to governed, verified action.
Read postAn agent loop in production: lessons from the first hundred actions
Designing for verification, blast-radius caps, and human-friendly audit trails. What we got wrong, what we'd ship differently, and what we're still figuring out.
Read postWhat system tables don't tell you about Databricks cost
DBU billing is one piece of the puzzle. Real cost lives in EC2, EBS, NAT gateways, and tag chaos. Here's how we reconcile across three tiers.
Read postWhy we built an autonomous agent for the lakehouse
Observability told us about problems for a decade. The next decade is about closing the loop. Here's how we think about it.
Read post