

Private Event
Reading Group & Discussion: The assistant axis: situating and stabilizing the character of large language models
Registration
Past Event
About Event
We'll be reading and discussing: The assistant axis: situating and stabilizing the character of large language models
Session Structure:
18:00-18:10: Introductions - please arrive on time!
18:10-18:50: silent paper reading
18:50-19:30: group discussion
For those who prefer to play around to see activation-capping in action, feel free to play around with: https://neuronpedia.org/assistant-axis
This paper has a complementary article, so if you'd prefer a lighter read take a look here: https://www.anthropic.com/research/assistant-axis
This is a private event. If there is someone who you think would be a good fit for our community, please share this link with them.