Cover Image for Reading Group & Discussion: The assistant axis: situating and stabilizing the character of large language models
Cover Image for Reading Group & Discussion: The assistant axis: situating and stabilizing the character of large language models
Avatar for AI Safety South Africa
6 Went
Private Event

Reading Group & Discussion: The assistant axis: situating and stabilizing the character of large language models

Register to See Address
Cape Town, South Africa
Registration
Past Event
Welcome! To join the event, please register below.
About Event

​We'll be reading and discussing: The assistant axis: situating and stabilizing the character of large language models

Session Structure:

  • 18:00-18:10: Introductions - please arrive on time!

  • 18:10-18:50: silent paper reading

  • 18:50-19:30: group discussion

For those who prefer to play around to see activation-capping in action, feel free to play around with: https://neuronpedia.org/assistant-axis

This paper has a complementary article, so if you'd prefer a lighter read take a look here: https://www.anthropic.com/research/assistant-axis

This is a private event. If there is someone who you think would be a good fit for our community, please share this link with them.

Location
Please register to see the exact location of this event.
Cape Town, South Africa
Avatar for AI Safety South Africa
6 Went