Cover Image for AI Safety Evals - Paper Reading Club
Cover Image for AI Safety Evals - Paper Reading Club
Avatar for BlueDot Impact
Presented by
BlueDot Impact
We’re building the workforce needed to safely navigate AGI.
Contact: [email protected]

AI Safety Evals - Paper Reading Club

Zoom
Registration
Past Event
Welcome! To join the event, please register below.
About Event

An author presentation this week: Morgan Sinclaire will present his paper When can we trust untrusted monitoring? A safety case sketch across collusion strategies. This is very recent research following up on the foundational control paper discussed on April 7. Please come to both!

​Every week, someone will present for up to 20 minutes followed by 40 minutes of discussion. RSVP to join, sign up to present, or contact us at [email protected] with questions. Everyone is welcome!

Avatar for BlueDot Impact
Presented by
BlueDot Impact
We’re building the workforce needed to safely navigate AGI.
Contact: [email protected]