Cover Image for AI Safety Evals - Paper Reading Club
Cover Image for AI Safety Evals - Paper Reading Club
Avatar for BlueDot Impact Events
We’re building the workforce needed to safely navigate AGI. Contact: [email protected]

AI Safety Evals - Paper Reading Club

Zoom
Registration
Welcome! To join the event, please register below.
About Event

We are reading:

Chain-of-Thought Reasoning In The Wild Is Not Always Faithful
https://arxiv.org/abs/2503.08679

Author Iván Arcuschin will be joining us for a talk and discussion!

Every week, someone will present for up to 20 minutes followed by 40 minutes of discussion. RSVP to join, check our schedule, volunteer to present, pick one paper from our suggested list or propose your own.

Note: Formerly scheduled, but delayed to next week:
CoT May Be Highly Informative Despite “Unfaithfulness”

https://metr.org/blog/2025-08-08-cot-may-be-highly-informative-despite-unfaithfulness/

Avatar for BlueDot Impact Events
We’re building the workforce needed to safely navigate AGI. Contact: [email protected]