

Evaluating Conversational AI Agents - Opik Virtual Learning Series
Registration
About Event
👏 Learn how to implement evals for conversational AI Agents.
This is a hands-on workshop series from Comet led by Claire Longo. In this session, we will guide you through an interactive coding session, discuss LLM eval best practices, and facilitate live Q&A.
If you're building AI Agents and passionate about quality, this is the session for you!!
In this talk, we will show how to implement a practical workflow for building and evaluating conversational Agents using Comet Opik. We will demonstrate how to log traces, annotate sessions with expert insights, and design LLM-as-a-Judge metrics that mimic human reasoning, turning domain expertise into a repeatable feedback loop.