

π AI BENCHMARK CLUB
ββtl;dr Come join us for technical talks focusing on AI benchmarks. We're a group of AI engineers, researchers, and academics.
βGet ready for three incredible speakers / industry leaders:
βMark Saroufim from GPU Mode / Meta
βGeorge Cameron from Artificial Analysis
βDaniel Kim from Cerebras
βThis event is part of Open Source AI Week, right next to the PyTorch Conference venue.
βDetails
ββπ Date & Time: Wednesday, October 22 @ 5:30 PM
π Location: SF, near Moscone West.
(Food and drinks provided. ππΊ)
βTalk 1 by Mark will showcase BackendBench, for evaluating LLM-generated kernels with a focus on correctness
βTalk 2 by George will cover Artificial Analysis's industry-shaping work in evaluating frontier models across many dimensions
βTalk 3 by Daniel will discuss work his team did evaluating REAP, a new pruning method for sparse Mixture-of-Experts models on the agentic SWE-Bench benchmark.
βAgenda:
ββ5:30 - 6: Arrival, Food, Networking
ββ6 - 7: Technical Talks, Q&A
ββ7 - 7:30: Post Talk Networking
βAbout the Meetup
βAre you an engineer or researcher interested in measuring AI systems? This is a technical meetup where we dive into the guts of AI benchmarks of all stripes.
ββInterested in sharing AI benchmarking work you've done at a future session? Tell us about it.