Cover Image for Paris inference & vLLM meetup #2
Cover Image for Paris inference & vLLM meetup #2
Avatar for EXXA
Presented by
EXXA

Paris inference & vLLM meetup #2

Register to See Address
Paris, Île-de-France
Registration
Event Full
If you’d like, you can join the waitlist.
Please click on the button below to join the waitlist. You will be notified if additional spots become available.
About Event

Join us for the second inference & vLLM technical meetup in Paris, bringing together AI practitioners, infrastructure and inference experts, as well as companies using vLLM in production.

Whether you're experimenting with vLLM or running large-scale inference workloads, this event is for you. Expect hands-on insights, real-world feedback, and open discussions with others working on optimizing inference at scale.

📍 Location: Paris, France
🌎 Language: English
🕖 Time: 6:30PM – 10:00PM
💬 Format: In-person

Agenda:

  • 6:30 – 7:00 PM: Welcome & Check-in

  • 7:00 – 8:30 PM: Technical Talks

    • Exxa - Etienne Balit (CTO): Scaling LLM inference, from multi-gpus to multi-nodes deployments

    • Apple - Olivier Dehaene (ML inference): Limitations of current inference stacks and future directions

    • .txt - Rémi Louf (CEO): Optimizing structured generation inference

  • 8:30 – 10 PM: Open networking & drinks + pizzas

We’ll discuss performance optimizations, scaling strategies, hardware compatibility, and more.

🎙️Do you want to become a speaker?
We're always looking for new speakers to share their technical experience with inference & vLLM. If you're interested, please fill this form 👉 Link

🎯 Who should come?
ML engineers, infra & DevOps teams, AI founders, and anyone working on inference, using or evaluating vLLM in their stack.

🎟️ Free registration – spots are limited

Location
Please register to see the exact location of this event.
Paris, Île-de-France
Avatar for EXXA
Presented by
EXXA