Cover Image for Eval Engineering for AI app developers - Lesson 1: Hello Evals!
Cover Image for Eval Engineering for AI app developers - Lesson 1: Hello Evals!
Avatar for Galileo Events
Presented by
Galileo Events
Calendar of events for AI evaluation company Galileo
284 Going

Eval Engineering for AI app developers - Lesson 1: Hello Evals!

YouTube
Registration
Welcome! To join the event, please register below.
About Event

Learn Eval Engineering in this free, 5-part, hands-on course.

90% of AI agents don't make it successfully to production. The biggest reason is the AI engineers building these apps don't have a clear way of evaluating that these agents are doing what they should do, and using the results of this evaluation to fix them.

In this course, you will learn all about evals for AI applications. You'll start with some out-of-the-box metrics and learn about evals, then move onto understanding observability for AI apps, analyzing failure states, defining custom metrics, then finally using these across your whole SDLC.

This will be hands on, so be prepared to write some code, create some metrics, and do some homework!

In this first lesson, you will

  • Learn what evals are

  • Learn how you can use simple evals to detect issues in an AI application

  • Get hands on adding an eval to an app

Prerequisites:

  • A basic knowledge of Python

  • Access to an OpenAI API key

  • A free Galileo account (we will be using Galileo as the evals platform)

Future lessons

Lesson 2: https://luma.com/vmcrtnkx
Lesson 3: https://luma.com/3k99shl1
Lesson 4: https://luma.com/x2ztpa4f
Lesson 5: https://luma.com/esoi6izo

Avatar for Galileo Events
Presented by
Galileo Events
Calendar of events for AI evaluation company Galileo
284 Going