

Istanbul vLLM & llm-d Inference Meetup
Türkiye vLLM & llm-d Inference Meetup
Hosted by Red Hat AI, NVIDIA, and BeyondGuard, this event takes place on 17 June 2026 in Istanbul, Türkiye.
Join us for a deep dive into the engine room of vLLM and llm-d AI inferencing, where we will focus on the architecture, optimizations, and raw engineering required to run inference at scale.
Whether you’re looking to squeeze every last token out of your GPU cluster or you're curious about the latest commits to the vLLM and llm-d ecosystems, this is the room you want to be in.
What to Expect
Deep Technical Sessions: Hear directly from the maintainers and core committers of vLLM and llm-d
Scale in Production: Learn from industry leaders about deploying LLMs in production
Live Demos: See live demos focused on real-world workflows
Networking: Stick around for food and drinks. It’s a great chance to chat with the speakers and exchange ideas with fellow developers and engineers.
Who Should Attend
vLLM and llm-d users and contributors
ML and infra engineers working on inference and serving
Platform teams running GenAI in production
Anyone curious about efficient inference across local, cloud, and Kubernetes
Agenda (Subject to More Awesomeness)
13:30 – 14:00 — Doors Open, Check-In
14:00 – 14:10 — Welcome and Opening Remarks
Erkan Ercan, Principal Solution Architect, Cloud & AI Platforms, Red Hat Türkiye
14:10 – 14:40 — Intro to vLLM and Project Update
Michael Goin, vLLM Core Maintainer and Sr. Principal Engineer, Red Hat AI
14:40 – 15:10 — Efficient vLLM Inference with Model Optimization
Mireille Fares, GenAI Solution Architect, NVIDIA
15:10 – 15:40 — Scalable, Distributed Inference with llm-d
Edoardo Vacchi, Principal ML Engineer, Red Hat AI
15:40 – 15:55 — Coffee & Tea Break
15:55 – 16:15 — Intro to Speculative Decoding for Fast Inference
Michael Goin, vLLM Core Maintainer and Sr. Principal Engineer, Red Hat AI
16:15 – 16:45 — Securing vLLM in Production: Prompt Injection Defense, Data Protection, and Runtime Policy Enforcement
Tufan Küpeli, CTO, BeyondGuard
16:45 – 17:00 — Live AI Demos
17:00 – 17:30 — Discussion and Q&A
17:30 – 19:00 — Networking & Drinks
Important information
Registration closes 24 hours before the event. We cannot admit unregistered attendees.
Please bring a photo ID to verify your registration on arrival.
See you in Istanbul
If you are building, deploying, or scaling inference, this is the room to be in.
See you soon!