We’re building the workforce needed to safely navigate AGI.   

Contact: team@bluedot.org

BlueDot Impact

​Continuing our theme of recursive self-improvement, Mark Keavney will present 

RE-bench: Evaluating frontier AI R&D capabilities of language model agents against human experts

​Every week, someone will present for up to 20 minutes followed by 40 minutes of discussion. RSVP to join, 

, or contact us at evalsreadinggroup@gmail.com with questions. Everyone is welcome!

AI Safety Evals - Paper Reading Club