Cover Image for Evaluating Conversational AI Agents - Opik Virtual Learning Series

Presented by

Comet provides an end-to-end model evaluation platform for AI developers, with best in class LLM evaluations, experiment tracking, and production monitoring

Hosted By

61 Going

AI

Evaluating Conversational AI Agents - Opik Virtual Learning Series

Comet

Zoom

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

👏 Learn how to implement evals for conversational AI Agents.

This is a hands-on workshop series from Comet led by Claire Longo. In this session, we will guide you through an interactive coding session, discuss LLM eval best practices, and facilitate live Q&A.

If you're building AI Agents and passionate about quality, this is the session for you!!

In this talk, we will show how to implement a practical workflow for building and evaluating conversational Agents using Comet Opik. We will demonstrate how to log traces, annotate sessions with expert insights, and design LLM-as-a-Judge metrics that mimic human reasoning, turning domain expertise into a repeatable feedback loop.

Presented by

Comet

Comet provides an end-to-end model evaluation platform for AI developers, with best in class LLM evaluations, experiment tracking, and production monitoring

Hosted By

61 Going

AI