![Cover Image for Lakehouse, Lagers & Legends [Kolkata]: Building ML-Ready Data Pipelines with Databricks](https://images.lumacdn.com/cdn-cgi/image/format=auto,fit=cover,dpr=2,background=white,quality=75,width=400,height=400/uploads/ng/e6a242ac-5ab6-475d-a91b-87e763c8aa3a.jpg)
![Cover Image for Lakehouse, Lagers & Legends [Kolkata]: Building ML-Ready Data Pipelines with Databricks](https://images.lumacdn.com/cdn-cgi/image/format=auto,fit=cover,dpr=2,background=white,quality=75,width=400,height=400/uploads/ng/e6a242ac-5ab6-475d-a91b-87e763c8aa3a.jpg)
Lakehouse, Lagers & Legends [Kolkata]: Building ML-Ready Data Pipelines with Databricks
About This Session
We’re bringing a special edition of Lakehouse, Lagers, & Legends [Kolkata] — focused on one of the most exciting shifts happening in data platforms today: moving beyond static pipelines into intelligent, governed orchestration.
Session Overview
This session will explore how raw data is ingested, transformed, and refined into trusted, business-ready datasets that support analytics, machine learning, and decision-making.
We will walk through the end-to-end data journey, starting with data ingestion using Auto Loader, followed by structured transformation through the Medallion Architecture. The session will cover how data flows from the Bronze layer, where raw data is captured, to the Silver layer, where data is cleaned, standardized, and validated, and finally to the Gold layer, where curated datasets are prepared for reporting, analytics, and machine learning use cases.
We will also discuss how data scientists leverage these refined datasets to uncover insights, build predictive models, and translate analytical findings into actionable strategies that improve operations, enhance customer experiences, and drive better business outcomes.
What We’ll Cover
Introduction to building machine learning models
Introduction to AI agents and their role in modern data workflows
Setting up a data pipeline using the Medallion Architecture
Ingesting data efficiently using Auto Loader
Transforming raw data across Bronze, Silver, and Gold layers
Applying governance programmatically using Unity Catalog
Automating data quality checks and validation processes
Exploring performance optimization strategies for scalable data pipelines
Translating refined data into insights, predictions, and business actions
Q&A and open discussion on real-world scenarios, challenges, and best practices
Networking with data engineers, data scientists, platform teams, and AI professionals
Join us as v4c.ai × Databricks set the stage for many more community-first gatherings ahead. Secure your spot now and be part of this exciting event!
Data Legends on the Speaker Panel are:
About Lakehouse, Lagers & Legends
What started as a few meetups for data engineers has turned into a worldwide community of people who live and breathe data. Each event blends two things our crowd cares about most: sharp, practical sessions on the latest in data engineering and a relaxed setting to trade ideas over a cold drink.
Every stop is a little different- new city, new voices, fresh conversations, but the core stays the same: bring smart people together, share real-world lessons, and spark collaborations that carry on long after the night ends.
Join our LinkedIn group of data builders across the globe!