Cover Image for AI Safety Technical Course 3 - RLHF: PPO & Agents
Cover Image for AI Safety Technical Course 3 - RLHF: PPO & Agents
Avatar for SAIN Utrecht - Events
14 Going

AI Safety Technical Course 3 - RLHF: PPO & Agents

Registration
Welcome! Please choose your desired ticket type:
About Event

This lecture will cover PPO & Agents.

In this lecture:

  • Introduction to PPO (Proximal Policy Optimization) as a reinforcement learning algorithm and its advantages

  • PPO agent - defining the environment, policy and objectives

  • Learning phase - understanding how the agent improves through feedback

  • Training loop - putting everything together into a full RL training pipeline

  • Intoduction to RLHF: Connecting PPO to modern AI systems

To receive the certificate, you must complete the notebooks and attend this lecture, as in-person attendance is mandatory.

However, we will stream the lecture online for those who cannot attend in person.

Google Meet Link:
meet.google.com/kyu-utay-qia

Drift 23 is accessible through the library.

We'll be serving pizzas and snacks during the lecture.

After the lecture, you're invited to join us for drinks on the house.

Course material is inspired by ARENA, leading UK program in AI Safety.

Location
Drift 23, room 113
3512 BR Utrecht, Netherlands
Avatar for SAIN Utrecht - Events
14 Going