Cover Image for Open Data Circle – Lakehouse Meetup #2
Cover Image for Open Data Circle – Lakehouse Meetup #2
Avatar for Open Data Circle
Presented by
Open Data Circle
Registration
Welcome! Please choose your desired ticket type:
About Event

Overview

Open Data Circle – Lakehouse Meetup #2, co-hosted with CelerData, explores how modern data platforms are evolving toward an AI-native future — as AI-driven workloads reshape expectations for storage, indexing, and real-time analytics across table formats, compute engines, and retrieval-oriented applications.

This meetup examines these shifts from three complementary perspectives:

  • Architectural directions for making lakehouse systems AI-native

  • StarRocks as a next-generation high-performance engine for the AI era

  • Practical lessons from building RAG systems, including an AI Legal Reviewer demo

Expect concise, insight-driven sessions and opportunities to connect with peers shaping the next generation of data platforms.

Agenda

All sessions will be conducted in English; no live translation will be provided.

18:30 – 19:00: Arrival & Check-In (Late arrivals cannot enter the venue after 19:15)

19:00 – 19:05: Opening Remarks

19:05 – 19:25: Talk #1 – Exploring AI-native Directions for Lakehouse (Zhiyan Xiao, Open Data Circle)

19:25 – 19:45: Talk #2 – StarRocks: A Fast Data Engine for Rapid AI Development (Cheng Sun, CelerData)

19:45 – 20:05: Talk #3 – Exploring the AI Legal Reviewer using RAG (Alec Lee, Open Data Circle)

20:05 – 20:20: Open Discussion (Q&A, sharing, etc.)

20:20 – 21:30: Networking Session

Agenda is subject to minor adjustments before showtime; check this page or your Luma updates for the latest schedule.

Session Materials

Slides and resources from the talks will be published shortly after the event — stay tuned for the update.

Speakers

Zhiyan Xiao

Organizer, Open Data Circle

Exploring AI-native Directions for Lakehouse

AI-native workloads are redefining expectations for lakehouse systems — requiring table formats and storage layers to support richer semantics, adaptive indexing, and tighter integration with real-time and vector-driven data.
This talk examines emerging approaches for making lakehouses more AI-native, with a focus on architectural shifts happening at the table-format and data-layout layers. By recognizing patterns emerging across modern open formats and storage designs, the session highlights how these developments are opening new possibilities for more adaptive, intelligent, and future-ready data platforms.

Cheng Sun

Customer Success, CelerData

StarRocks: A Fast Data Engine for Rapid AI Development

With the rapid evolution of Large Language Models (LLMs) and AI Agents, data governance has become a core competitive differentiator for enterprises.
To meet the demanding requirements of modern AI applications, this session explores how CelerData has optimized the essential pillars of a Data Engine: real-time performance, compatibility, and scalability. We will demonstrate how StarRocks delivers a lightning-fast, unified lakehouse architecture, providing the robust data foundation necessary for the AI era.

Alec Lee

AX Full-stack Explorer / Contributor, Open Data Circle

Exploring the AI Legal Reviewer using RAG

Alec is a DX & AX (AI Transformation) Specialist who converts strategic vision into measurable execution, experienced in bridging technology and business value, managing complex projects across diverse cultures, organizations and systems.
This talk will explore an AI Legal Reviewer leveraging RAG and open-source systems to ensure contract compliance, focusing on accuracy and safety.

Organizers and Partners

Open Data Circle (ODC)

Open Data Circle (ODC) is an independent community based in Tokyo that connects data engineers, platform builders, and open-source enthusiasts who are passionate about building the future of data systems together. We explore modern data architectures — from streaming and lakehouse to AI-native systems — through hands-on meetups, open discussions, and collaborative projects. ODC aims to create a friendly and high-quality space where professionals across companies can exchange ideas, learn from each other, and drive innovation in the data ecosystem.

CelerData

CelerData, Inc. is a visionary real-time analytics company established to scale the capabilities of StarRocks for the enterprise. By delivering the commercial edition of StarRocks and a flexible Bring-Your-Own-Cloud (BYOC) service, CelerData bridges the gap between open-source innovation and complex enterprise needs. The company empowers data teams to transcend infrastructure management, allowing them to focus on delivering customer value through blazing-fast Lakehouse architectures and AI-native workflows. Trusted by global innovators like Airbnb, Pinterest, and Demandbase, CelerData provides the critical support, security, and performance required to turn petabyte-scale data into instant intelligence.

Location

Onsite: LODGE (LY Corporation Head Office, Kioicho, Tokyo). Detailed arrival instructions will be shared later.

Online: Zoom dial-in details will be shared closer to the event via Luma.

Location
LY Corporation Head Office
Japan, 〒102-8282 Tokyo, Chiyoda City, Kioichō, 1−3 東京ガーデンテラス 紀尾井タワ
Avatar for Open Data Circle
Presented by
Open Data Circle