

Presented by
BlueDot Impact
We’re building the workforce needed to safely navigate AGI.
Contact: [email protected]
AI Safety Evals - Paper Reading Club
Registration
Past Event
About Event
Continuing our theme of recursive self-improvement, Mark Keavney will present RE-bench: Evaluating frontier AI R&D capabilities of language model agents against human experts.
Every week, someone will present for up to 20 minutes followed by 40 minutes of discussion. RSVP to join, sign up to present, or contact us at [email protected] with questions. Everyone is welcome!
Presented by
BlueDot Impact
We’re building the workforce needed to safely navigate AGI.
Contact: [email protected]