Cover Image for Istanbul vLLM & llm-d Inference Meetup
Cover Image for Istanbul vLLM & llm-d Inference Meetup
Avatar for vLLM Meetups and Events
Join the vLLM community to discuss optimizing LLM inference!
Registration
Approval Required
Your registration is subject to host approval.
Welcome! To join the event, please register below.
About Event

Türkiye vLLM & llm-d Inference Meetup

Hosted by Red Hat AI, NVIDIA, and BeyondGuard, this event takes place on 17 June 2026 in Istanbul, Türkiye.

Join us for a deep dive into the engine room of vLLM and llm-d AI inferencing, where we will focus on the architecture, optimizations, and raw engineering required to run inference at scale.

Whether you’re looking to squeeze every last token out of your GPU cluster or you're curious about the latest commits to the vLLM and llm-d ecosystems, this is the room you want to be in.

What to Expect

  • Deep Technical Sessions: Hear directly from the maintainers and core committers of vLLM and llm-d

  • Scale in Production: Learn from industry leaders about deploying LLMs in production

  • Live Demos: See live demos focused on real-world workflows

  • Networking: Stick around for food and drinks. It’s a great chance to chat with the speakers and exchange ideas with fellow developers and engineers.

Who Should Attend

  • vLLM and llm-d users and contributors

  • ML and infra engineers working on inference and serving

  • Platform teams running GenAI in production

  • Anyone curious about efficient inference across local, cloud, and Kubernetes

Agenda (Subject to More Awesomeness)

13:30 – 14:00 — Doors Open, Check-In

14:00 – 14:10 — Welcome and Opening Remarks

Erkan Ercan, Principal Solution Architect, Cloud & AI Platforms, Red Hat Türkiye

14:10 – 14:40 — Intro to vLLM and Project Update

Michael Goin, vLLM Core Maintainer and Sr. Principal Engineer, Red Hat AI

14:40 – 15:10 — Efficient vLLM Inference with Model Optimization

Mireille Fares, GenAI Solution Architect, NVIDIA

15:10 – 15:40 — Scalable, Distributed Inference with llm-d

Edoardo Vacchi, Principal ML Engineer, Red Hat AI

15:40 – 15:55 — Coffee & Tea Break

15:55 – 16:15 — Intro to Speculative Decoding for Fast Inference

Michael Goin, vLLM Core Maintainer and Sr. Principal Engineer, Red Hat AI

16:15 – 16:45 — Securing vLLM in Production: Prompt Injection Defense, Data Protection, and Runtime Policy Enforcement

Tufan Küpeli, CTO, BeyondGuard

16:45 – 17:00 — Live AI Demos

17:00 – 17:30 — Discussion and Q&A

17:30 – 19:00 — Networking & Drinks

Important information

Registration closes 24 hours before the event. We cannot admit unregistered attendees.

Please bring a photo ID to verify your registration on arrival.

See you in Istanbul

If you are building, deploying, or scaling inference, this is the room to be in.

See you soon!

Location
İTÜ Taşkışla Kampüsü Mimarlık Fakültesi
Harbiye, İTÜ Taşkışla Kampüsü Mimarlık Fakültesi, 34367 Şişli/İstanbul, Türkiye
Conference room 109
Avatar for vLLM Meetups and Events
Join the vLLM community to discuss optimizing LLM inference!