AI Evals w/ Alex Gu β Evaluating AI Systems on Mathematical and Coding Tasks
βAbout Event
βββββββπ¬ AI Evals on AlphaXiv
βπ Wednesday, November 5th 2025 Β· 11AM PT
βπ Featuring Alex Gu
βπ¬ Moderated Discussion + Q&A
βAI Evals Series: Evaluating AI Systems on Mathematical and Coding Tasks
βWeβre excited to host Alex Gu, a PhD student at MIT whose research focuses on evaluating and improving AI systems on programming and mathematical reasoning. In this session, Alex will share insights from his work on widely-used benchmarks and tools, such as LiveCodeBench, LeanDojo, IneqMath, CruxEval, and more. Heβll also discuss how these evaluations inform our understanding of AI capabilities, and explore the future of training and assessing AI models on math and code tasks.
βThis event is virtural. The zoom link will be shared upon registration. The talk will later be uploaded to AlphaXivβs YouTube Channel
βHosted by: alphaXiv x Vals AI
βββββββAI Evals: join the community
