Real Time Data Lakes ft ClickHouse®, DuckDB, StarRocks, and S3

Hosted by Open Source Analytics Community & 3 others
Registration
Welcome! To join the event, please register below.
About Event

​The OSA Community is proud to present an evening of insights and networking in New York City, featuring experts from ClickHouse®, StarRocks, Snowflake, and AWS.

Real-time databases are integrating with data lakes to reduce storage costs and share data with AI and data science. Please join us to hear from a range of experts as they share current problems and solutions while navigating the transition from closed storage models to open table formats like Apache Iceberg.

Join us in New York City for an evening with experts from ClickHouse, StarRocks, Snowflake, and AWS. Learn how to build high-performance real-time data lake systems. Networking to follow presentations. Food and drink provided!

P.S. The OSA Community is also hosting the online OSA Conference on November 4–5. Register here.

Presenters

Description of the Talks

Adapting ClickHouse to use Apache Iceberg Storage - Robert Hodges, CEO @ Altinity.

  • ​Covers Altinity's Project Antalya, which is adapting open source ClickHouse to introduce separation of compute and storage using Iceberg tables as. Architecture, performance results, and roadmap are included. 

Achieving Data Warehouse Performance on Apache Iceberg - Ron Kapoor, Developer Advocate @ CelerData

  • ​This talk dives into technical optimizations that deliver low-latency, high-concurrency queries on Apache Iceberg without sacrificing openness. Together, we'll examine what kills performance when querying Iceberg, highlight best practices that make queries faster, and evaluate query engine optimizations for Iceberg—including handling position and equality delete tables, distributed metadata parsing, and more. You'll hear real-world stories from leading enterprises who have used these lessons to optimize Apache Iceberg performance at scale and walk away with actionable techniques for making your Iceberg lakehouse faster than ever.

What the Duck? - Elizabeth Christensen, Developer Advocate @ Snowflake

  • DuckDB is the new database technology that seems to pop up in every database conversation this year. It is a lightweight query engine designed for fast analytical queries without the overhead of a traditional database server. This talk will be a technical introduction to DuckDB - how it works, how to set it up, and several demos of analytics using DuckDB and freely available data. DuckDB has a novel approach to data, utilizing object storage and data formats like Iceberg, Parquet, JSON, and GeoParquet. We’ll talk about these file formats and some of the hybrid approaches that integrate DuckDB with other platforms. I’ll include several hands-on sample code for working with DuckDB from basic queries to more complicated SQL and geospatial use cases. If you’re curious about DuckDB or are ready to integrate it into your day to day data projects, let’s get you started.

Description of the Presenters

Robert Hodges - Robert is the CEO of Altinity, an enterprise provider for ClickHouse data warehouse. He's also a database geek with experience on at least 20 DBMS types. Robert caught the Kubernetes bug at VMware in 2018.

Connect with Robert on LinkedIn.

Ron Kapoor - Ron is a Developer Advocate at CelerData, where he helps bridge the gap between high-performance analytics technology and the data engineering community. Before joining CelerData, Ron worked at Jefferies, where he engineered real-time streaming systems and electronic trading infrastructure for fixed income products. His work focused on ultra-low-latency performance, reliability, and data integrity in capital markets environments. He’s especially interested in modern data architecture. Particularly streaming analytics, lakehouse ecosystems, and how simplicity and performance intersect in developer-centric systems.

Connect with Ron on LinkedIn.

Elizabeth Christensen - Elizabeth is a Postgres contributor and technical writer at Snowflake from Lawrence, Kansas. She is on the board of the US PostgreSQL Association, hosts Postgres Meetup for All, and the annual PostGIS Day. Elizabeth is passionate about open source technology and diversity in tech through education. 

Connect with Elizabeth on LinkedIn.

Location
307 W 38th St #1505
New York, NY 10018, USA