Paris inference & vLLM meetup #2

EXXA

Register to See Address

Paris, France

Sold Out

This event is sold out and no longer taking registrations.

About Event

Join us for the second inference & vLLM technical meetup in Paris, bringing together AI practitioners, infrastructure and inference experts, as well as companies using vLLM in production.

Whether you're experimenting with vLLM or running large-scale inference workloads, this event is for you. Expect hands-on insights, real-world feedback, and open discussions with others working on optimizing inference at scale.

📍 Location: Paris, France
🌎 Language: English
🕖 Time: 6:30PM – 10:00PM
💬 Format: In-person

Agenda:

6:30 – 7:00 PM: Welcome & Check-in
7:00 – 8:30 PM: Technical Talks
- Exxa - Etienne Balit (CTO): Scaling LLM inference, from multi-gpus to multi-nodes deployments
- .txt - Rémi Louf (CEO): Optimizing structured generation inference
- Hugging Face - Luc Georges (ML & Software Engineer): Transformers serve
8:30 – 10 PM: Open networking & drinks + pizzas

We’ll discuss performance optimizations, scaling strategies, hardware compatibility, and more.

🎙️Do you want to become a speaker?
We're always looking for new speakers to share their technical experience with inference & vLLM. If you're interested, please fill this form 👉 Link

🎯 Who should come?
ML engineers, infra & DevOps teams, AI founders, and anyone working on inference, using or evaluating vLLM in their stack.

🎟️ Free registration – spots are limited

Location

Please register to see the exact location of this event.

Paris, France

Presented by

EXXA

Hosted By

AI