AI Safety Technical Course 3 - RLHF: PPO & Agents

Name: AI Safety Technical Course 3 - RLHF: PPO & Agents
Start: 2026-06-03T18:00:00.000+02:00
End: 2026-06-03T19:00:00.000+02:00
Location: Drift 23, room 113

SAIN Utrecht - Events

Drift 23, room 113

Utrecht, Netherlands

Welcome! Please choose your desired ticket type:

You will be asked to verify token ownership with your wallet.

About Event

This lecture will cover PPO & Agents.

In this lecture:

Introduction to PPO (Proximal Policy Optimization) as a reinforcement learning algorithm and its advantages
PPO agent - defining the environment, policy and objectives
Learning phase - understanding how the agent improves through feedback
Training loop - putting everything together into a full RL training pipeline
Intoduction to RLHF: Connecting PPO to modern AI systems

To receive the certificate, you must complete the notebooks and attend this lecture, as in-person attendance is mandatory.

However, we will stream the lecture online for those who cannot attend in person.

Google Meet Link:
meet.google.com/kyu-utay-qia

Drift 23 is accessible through the library.

We'll be serving pizzas and snacks during the lecture.

After the lecture, you're invited to join us for drinks on the house.

Course material is inspired by ARENA, leading UK program in AI Safety.

Location

Drift 23, room 113

3512 BR Utrecht, Netherlands

Presented by

SAIN Utrecht - Events

Hosted By

14 Going

IA