Personal

Aditya Iyer

A weekly reading group for anyone interested in mechanistic interpretability research. Our goal is to read papers from the mech interp community, share ideas, and deepen our understanding of how neural networks work under the hood.

Each week, we pick a paper to read and discuss together. The paper for the week will typically be posted on Monday, giving everyone time to read ahead.

Everyone is welcome to present a paper or suggest one for a future session. If you have paper suggestions or want to present, reach out to adityaiyer.m@gmail.com — all levels of familiarity with the field are welcome!

Edit: For our first meeting, we'll be reading: 

Mechanistic Interpretability (ML) Reading Group

Lokesh

Tina Togo