Cover Image for Think Swift, Build More: Empowering Agents with RL Toolkits and Infrastructure
Cover Image for Think Swift, Build More: Empowering Agents with RL Toolkits and Infrastructure

Think Swift, Build More: Empowering Agents with RL Toolkits and Infrastructure

Register to See Address
San Jose, California
Registration
Event Full
If you’d like, you can join the waitlist.
Please click on the button below to join the waitlist. You will be notified if additional spots become available.
About Event

[In-person event in San Francisco] [50 spots only]

This event is cohosted by verl, SGLang, Zilliz and Creao AI and organized by Monolith Management. Together, we’ll explore the latest advances in RL, RL infrastructure, Reasoning, and Agentic AI

​We’ll open with several presentations and dig into:

  • verl Reinforcement Learning framework designed for efficient and flexible training of large-scale models

  • SGLang

    • Optimizing End2End Multi-turn RL with SGLang rollout

    • Tool uses a feature on SGLang with various tool parsers

    • SpecForge: A unified training framework for speculative decoding across LLMs, VLMs, and LoRAs

  • Zilliz Unlocking billion-scale AI search with Milvus for massive unstructured data

  • Creao AI Building tools and infrastructure for code agent

After the presentations, stick around for drinks, snacks, and deep-cut conversations with other engineers and researchers from OpenAI, Anthropic, MSL, xAI, and more

About Co-hosts & Speakers

verl: An open-source RL framework for LLMs, enabling efficient training across large-scale distributed systems with 300+ contributors and 12k+ stars on Github

  • Haibin Lin works on foundation models at Bytedance Seed, focusing on optimizing training infra for LLMs & multimodal models (with more than 10k GPUs), from pre-training to post-training. Prior to Bytedance, he works on Apache MXNet.

  • Ziheng Jiang works on LLM systems at Bytedance, focusing on scaling and optimizing LLM training and inference. His research work spans projects such as MegaScale (with 10K+ GPUs), MegaScale-Infer and Apache TVM.

SGLang: An open-source, fast serving and programming framework for LLMs and VLMs with 600+ contributors and 17k+ stars on Github

  • Chenyang Zhao, a Ph.D. student in Computer Science at UCLA. A proud member of the lmsys.org community and the leader of the SGLang RL team. Focus on building and optimizing Post-training Pipelines and Agentic RL Systems

  • Shuai Shi, Lead ML engineer in Cloudsway.AI, SpecForge maintainer, SGLang Core contributor, focus on MLLM inference and RL

Zilliz / Milvus: An open-source high-performance vector database powering large-scale AI search across text, images, and multi-modal data with 300+ contributors and 36k+ stars on Github

  • Jiang Chen is the Head of Developer Relations at Zilliz, the creator of Milvus open-source vector database. Previously, he was a tech lead and product manager at Google, leading the development of web-scale semantic understanding and search indexing

Creao AI: Building the foundational agentic infrastructure for a world where autonomous agents actively work on our behalf

  • Peter Pang, Co-founder and CTO of Creao AI, former Apple AI Scientist and Meta GenAI tech lead, is now pioneering AI-native platforms at his startup, advancing the frontier of agentic AI where humans and agents work side by side.

About Organizer

Monolith Management takes its name from the enigmatic black monolith in the iconic sci-fi movie "2001: A Space odyssey.", symbolizing universal truths with its 1:4:9 ratio—the squares of the first three natural numbers. At Monolith Management, we embody curiosity, resilience, and wisdom, blending detached thinking with decisive action. We aim to stand shoulder-to-shoulder with entrepreneurs, merging visionary concepts with steadfast execution to jointly shape the future.

M-A-P is an open-source AI research community.The community members are working on research topics in a wide range of spectrum, including but not limited to pre-training paradigm of foundation models, large-scale data collection and processing, and the derived applciations on coding, reasoning and music creativity.

Who should attend:

For researchers, developers, technical builders and founders working with Agentic AI, Coding, Reasoning or RL Infrastructure


​Limited capacity. RSVP required.

This event is on-site only, no recording available. If you cannot make it, please cancel registration

Location
Please register to see the exact location of this event.
San Jose, California