Open Lakehouse and AI
The OSA Community is proud to host the Real Time Data Lakes and AI event in San Francisco!
As real-time databases integrate more closely with data lakes to reduce storage costs and unlock data for AI and advanced analytics, data infrastructure is evolving fast. Join us to hear from leading experts as they share practical solutions and lessons learned in building open, scalable, and high-performance data platforms.
Speakers
Robert Hodges, CEO @ Altinity
Team member @ CelerData
Éamon Ryan, Senior Principal Field Engineer @ Grafana
Team member @ PostHog
Agenda
6 pm - Networking
6:15 - 8:00 pm - Talks
8:00 - 9:00 pm - Networking
Description of the Talks
Building a Foundation for AI with ClickHouse® and Apache Iceberg Storage
Speaker: Robert Hodges, CEO @ Altinity
Abstract: AI applications need data. Lots of it. Altinity's Project Antalya is adapting open source ClickHouse® to introduce separation of compute and storage on shared Iceberg table data. The result: fast, cheap, flexible query that extends the life of real-time analytic applications and lays the foundation for handling new AI use cases on the same datasets. We cover architecture, performance results, roadmap, and how to get started yourself.
Achieving High-Performance Analytics on Apache Iceberg
Speaker: Team member @ CelerData
Abstract:Apache Iceberg enables open and flexible lakehouse architectures, yet delivering low-latency, high-concurrency analytics in production remains a significant challenge. This talk examines the key factors that constrain query performance on Iceberg and explores how modern query engines, such as StarRocks, address these challenges.
We’ll examine common bottlenecks such as metadata overhead, delete handling (position and equality deletes), and query planning costs under concurrency. The session then dives into practical optimization techniques—including scalable metadata parsing, engine-level execution optimizations, and best practices for production Iceberg workloads.
Backed by real-world enterprise use cases, this talk provides actionable insights for engineers looking to run fast, reliable analytics on Apache Iceberg at scale.
Visualizing Your Data Lake with Grafana
Speaker: Éamon Ryan, Senior Principal Field Engineer @ Grafana @ Grafana
Abstract: In this brief talk, we’ll walk through how to get started with Grafana’s open source platform to explore and understand your data lake. We’ll cover how to connect to your data—no matter where it lives—then craft queries that turn raw information into clear, compelling visualizations, and finally set up alerts and annotations so you’re always in the know when something important changes in your data lake.
