Cover Image for AI Safety Evals - Paper Reading Club
Cover Image for AI Safety Evals - Paper Reading Club
Avatar for BlueDot Impact
Presented by
BlueDot Impact
We’re building the workforce needed to safely navigate AGI.
Contact: [email protected]

AI Safety Evals - Paper Reading Club

Zoom
Registration
Welcome! To join the event, please register below.
About Event

Join us for an author presentation! Evgenii Kortukov will present his paper on an unusual form of LLM deception: Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLMs.

​Every week, someone will present for up to 20 minutes followed by 40 minutes of discussion. RSVP to join, sign up to present, or contact us at [email protected] with questions. Everyone is welcome!

Avatar for BlueDot Impact
Presented by
BlueDot Impact
We’re building the workforce needed to safely navigate AGI.
Contact: [email protected]