Cover Image for SGLang Office Hours - Scaling LLM Serving with Ray and SGLang

Presented by

SGLang is an open-source high-performance inference framework for LLM, built and maintained by the LMSYS Org. Join our events to learn more!

Hosted By

SGLang Office Hours - Scaling LLM Serving with Ray and SGLang

SGLang Meetups and Events

YouTube

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

📣 Scaling LLM Serving with Ray and SGLang

SGLang Office Hours is back!

We're co-hosting with Anyscale to dig into how Ray powers large-scale LLM serving with SGLang.

Xinyu Zhang (@xinyzng), MTS at Anyscale, opens with a deep dive into the Ray executor backend inside SGLang: why Ray is needed, how it improves RL workload placement, and what you get from Ray cluster integration.

Jeffrey Wang (@jeffreyycwang), SWE at Anyscale, covers how Ray Serve fits into large-scale SGLang deployments, the roadmap, and where the community can contribute. We'll close with a live demo and walkthrough of features open for contribution.

Whether you're serving at scale or just getting started with the Ray integration, come with your questions.
LinkedIn Living stream 👉 SGLang Office Hour
Youtube Living stream 👉 SGLang Office Hour

Join SGLang Slack 👉 http://slack.sglang.ai/
Following us on X 👉 https://x.com/lmsysorg

If this helps you, please consider giving us a star — it truly motivates the team. ⭐ https://github.com/sgl-project/sglang

Presented by

SGLang Meetups and Events

SGLang is an open-source high-performance inference framework for LLM, built and maintained by the LMSYS Org. Join our events to learn more!

Hosted By