What Databricks Still Hasn't Solved
It's April 2025, two months before DAIS 2025, and I've been running a list of what Databricks hasn't solved from the 2024 announcements. This isn't a complaint — the platform has gotten better every quarter. But honest critique is how you calibrate what to pay attention to at the next Summit versus what's still vaporware.
The Agent Governance Gap
At DAIS 2024, Databricks announced Mosaic AI Agent Framework as a way to build compound AI systems. The framework is real and useful. What's still missing: first-class governance for agents in Unity Catalog. You can't register an agent as a UC object, define its authorized tool access in UC, audit its invocations through UC, or track its lineage in UC. The agents exist outside the governance layer that everything else in the platform runs through.
This matters because enterprises that are serious about deploying agents in production need to answer compliance questions: what tools does this agent have access to, under what conditions can it invoke them, and what's the audit trail. Right now, answering those questions requires custom instrumentation on top of the framework, which means every team answers them differently. UC-native agent governance would standardize this. I expect it to arrive; it's not here yet.
Cross-Account Lineage
Still missing in any coherent form. If you have a dev workspace and a prod workspace in separate Databricks accounts — which is the right security architecture for most enterprises — the lineage chain breaks at the account boundary. A model trained in dev and promoted to prod doesn't have end-to-end lineage from source data to production serving endpoint. You have two separate lineage chains that you have to mentally join.
This is a real limitation for enterprises doing regulatory AI documentation. "What data trained this production model" currently requires looking at both the prod account and the dev account and assembling the answer manually.
Streaming Governance in Unity Catalog
Unity Catalog governs batch Delta tables comprehensively. Streaming sources — Kafka topics, Event Hubs, streaming Delta tables maintained by Structured Streaming — are second-class citizens in the UC permission model. You can't manage Kafka topic access through UC. You can't track Kafka-to-Delta lineage through UC without custom instrumentation. For enterprises running streaming-heavy architectures, this is a daily friction point.
My Questions for Summit 2025
Three things I'm specifically looking for: a demo of UC-native agent governance that shows an agent registered as a UC object with governed tool access. A cross-account lineage story that's more than "we're working on it." And a streaming governance capability that puts Kafka sources and real-time Delta tables in the same UC governance namespace as batch tables. If those three things are on the keynote, DAIS 2025 will have been worth attending. As always, I'm here to help.