Cover Image for AI Safety Evals - Paper Reading Club

Presented by

BlueDot Impact

We’re building the workforce needed to safely navigate AGI. Contact: [email protected]

Hosted By

AI

AI Safety Evals - Paper Reading Club

BlueDot Impact

Zoom

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

We are reading:

Alignment faking in large language models
https://arxiv.org/abs/2412.14093

Every week, someone will present for up to 20 minutes followed by 40 minutes of discussion. RSVP to join or volunteer to present, pick one paper from our suggested list or propose your own.

Presented by

BlueDot Impact

We’re building the workforce needed to safely navigate AGI. Contact: [email protected]

Hosted By

AI