Optimizing Your Iceberg Tables for Real Time Analytics (Without Melting Your Budget)

Real Time Data + AI

Zoom

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

About

Optimizing Apache Iceberg for real time analytics is not just about getting the table format right. It is about keeping tables lean, well organized, and ready to serve fresh, low latency queries while data is continuously ingested. Without the right approach, small file storms from streaming and CDC, growing delete files, and unnecessary full table rewrites can slow dashboards and operational analytics while driving costs up.

In this webinar, we will share a practical blueprint for running fast and cost efficient Iceberg tables in production for real-time analytics across engines. We will cover usage aware optimization strategies, including targeted compaction strategies, right sized data and manifest files, effective management of equality and position deletes, and maintenance patterns that work with streaming and CDC pipelines instead of conflicting with them. We will also discuss freshness-oriented design choices such as partitioning and ordering for common filters, dynamic file sizing that adapts to ingestion rates, prompt rewrite of deletes in high traffic partitions, and metadata compaction to keep planning predictable. Finally, we will show how lightweight, specialized engines for table maintenance and optimization can replace oversized infrastructure so you achieve consistent low latency, predictable SLAs, and lower total cost without adding complexity.

Speakers
- Yingjun Wu, Founder and CEO, RisingWave
- Yuval Yogev, Co-founder and CTO, Ryft

Presented by

Real Time Data + AI

In-Person Events for (Real Time) Data and AI Geeks

Hosted By

127 Went

IA