Cover Image for SGLang Office Hours - Scaling LLM Serving with Ray and SGLang
Cover Image for SGLang Office Hours - Scaling LLM Serving with Ray and SGLang
Avatar for SGLang Meetups and Events
SGLang is an open-source high-performance inference framework for LLM, built and maintained by the LMSYS Org. Join our events to learn more!

SGLang Office Hours - Scaling LLM Serving with Ray and SGLang

YouTube
Registration
Welcome! To join the event, please register below.
About Event

β€‹πŸ“£ Scaling LLM Serving with Ray and SGLang

​SGLang Office Hours is back!

​We're co-hosting with Anyscale to dig into how Ray powers large-scale LLM serving with SGLang.

​Xinyu Zhang (@xinyzng), MTS at Anyscale, opens with a deep dive into the Ray executor backend inside SGLang: why Ray is needed, how it improves RL workload placement, and what you get from Ray cluster integration.

​Jeffrey Wang (@jeffreyycwang), SWE at Anyscale, covers how Ray Serve fits into large-scale SGLang deployments, the roadmap, and where the community can contribute. We'll close with a live demo and walkthrough of features open for contribution.

​Whether you're serving at scale or just getting started with the Ray integration, come with your questions.
LinkedIn Living stream πŸ‘‰ SGLang Office Hour
Youtube Living stream πŸ‘‰ SGLang Office Hour


Join SGLang Slack πŸ‘‰ http://slack.sglang.ai/
Following us on X πŸ‘‰ https://x.com/lmsysorg

​If this helps you, please consider giving us a star β€” it truly motivates the team. ⭐ https://github.com/sgl-project/sglang

Avatar for SGLang Meetups and Events
SGLang is an open-source high-performance inference framework for LLM, built and maintained by the LMSYS Org. Join our events to learn more!