

Presented by
BlueDot Impact
AI Safety Evals - Paper Reading Club
Registration
About Event
Another author presentation! Àlex Serrano Terré will present his work related to our situational awareness theme: Neural Chameleons: Language Models Can Learn to Hide Their Thoughts from Unseen Activation Monitors.
Every week, someone will present for up to 20 minutes followed by 40 minutes of discussion. RSVP to join, sign up to present, or contact us at [email protected] with questions. Everyone is welcome!
Presented by
BlueDot Impact