



π AI BENCHMARK CLUB
ββtl;dr Come join us for technical talks focusing on AI benchmarks. We're a group of AI engineers, researchers, and academics.
βGet ready for three incredible speakers / industry leaders:
- βMark Saroufim from GPU Mode / Meta 
- βGeorge Cameron from Artificial Analysis 
- βDaniel Kim from Cerebras 
βThis event is part of Open Source AI Week, right next to the PyTorch Conference venue.
βDetails
ββπ Date & Time: Wednesday, October 22 @ 5:30 PM
π Location: SF, near Moscone West.
(Food and drinks provided. ππΊ)
- βTalk 1 by Mark will showcase BackendBench, for evaluating LLM-generated kernels with a focus on correctness 
- βTalk 2 by George will cover Artificial Analysis's industry-shaping work in evaluating frontier models across many dimensions 
- βTalk 3 by Daniel will discuss work his team did evaluating REAP, a new pruning method for sparse Mixture-of-Experts models on the agentic SWE-Bench benchmark. 
βAgenda:
- ββ5:30 - 6: Arrival, Food, Networking 
- ββ6 - 7: Technical Talks, Q&A 
- ββ7 - 7:30: Post Talk Networking 
βAbout the Meetup
βAre you an engineer or researcher interested in measuring AI systems? This is a technical meetup where we dive into the guts of AI benchmarks of all stripes.
ββInterested in sharing AI benchmarking work you've done at a future session? Tell us about it.
