Cover Image for Reading Group & Discussion: Reasoning Models Struggle to Control their Chains of Thought
Cover Image for Reading Group & Discussion: Reasoning Models Struggle to Control their Chains of Thought
Avatar for AI Safety South Africa
Hosted By
5 Went
Private Event

Reading Group & Discussion: Reasoning Models Struggle to Control their Chains of Thought

Register to See Address
Cape Town, South Africa
Registration
Past Event
Welcome! To join the event, please register below.
About Event

We'll be reading and discussing: Reasoning Models Struggle to Control their Chains of Thought

The paper asks a simple but important safety question: can reasoning models deliberately hide or reshape what they say in their chain-of-thought, and the authors find that they currently struggle to do that much more than they struggle to control their final answer.

Session Structure:

  • 18:00-18:10: Introductions - please arrive on time!

  • 18:10-18:50: silent paper reading

  • 18:50-19:30: group discussion

This is a private event. If there is someone who you think would be a good fit for our community, please share this link with them.

Location
Please register to see the exact location of this event.
Cape Town, South Africa
Avatar for AI Safety South Africa
Hosted By
5 Went