Cover Image for How To Navigate Binary vs Score Evals
Cover Image for How To Navigate Binary vs Score Evals
Avatar for Arize AI
Presented by
Arize AI
Generative AI-focused workshops, hackathons, and more. Come build with us!
115 Going

How To Navigate Binary vs Score Evals

Zoom
Registration
Welcome! To join the event, please register below.
About Event

About a year ago, Arize AI released some early research on how reliable foundational models were at LLM-as-a-Judge when the output was a binary vs score eval. The results were very clear - binary evals were the way to go. It’s been over a year and the models are getting better. Does the research still hold?

In this session, Elizabeth Hutton (Senior AI Engineer) and Srilakshmi Chavali (AI Engineer) will dive into findings from newly released research!

Avatar for Arize AI
Presented by
Arize AI
Generative AI-focused workshops, hackathons, and more. Come build with us!
115 Going