

Mechanistic Interpretability (ML) Reading Group
Hosted by Aditya Iyer
Registration
About Event
A weekly reading group for anyone interested in mechanistic interpretability research. Our goal is to read papers from the mech interp community, share ideas, and deepen our understanding of how neural networks work under the hood.
Each week, we pick a paper to read and discuss together. The paper for the week will typically be posted on Monday, giving everyone time to read ahead.
Everyone is welcome to present a paper or suggest one for a future session. If you have paper suggestions or want to present, reach out to [email protected] — all levels of familiarity with the field are welcome!
Edit: For our first meeting, we'll be reading: https://arxiv.org/pdf/2305.00586