Beyond Benchmarks: Ethics and AI Models in the Real World

Name: Beyond Benchmarks: Ethics and AI Models in the Real World
Start: 2025-10-14T18:00:00.000-04:00
End: 2025-10-14T20:00:00.000-04:00
Location: 2112 Pennsylvania Ave NW

DC Data & AI Events

2112 Pennsylvania Ave NW

Washington, District of Columbia

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

This is a cross post of https://www.meetup.com/dc-nlp/events/311125097/ - no need to sign up both places. 🙂

Join us for our Responsible AI talk with Professor Patrick Hall! Big thanks to DC-NLP for organizing an MCing and Prefect for providing, space, food, and drinks!

Agenda:

6:00 - 6:30 PM - Welcome and mingle
6:30- 6:45 PM - Introductions
6:45 - 7:30 PM - Talk
7:30 - 8:00 PM - Wrap up

Description:

Benchmarks are useful, but it’s common sense that they can’t tell us how AI behaves in the real world. Worse, they encourage proxy games and number-chasing (Goodhart’s Law) and can be distorted by task or data contamination. What truly matters are in-situ outcomes and failure modes—privacy leaks, biased or unsafe behavior, misinformation cascades—which static leaderboards rarely reveal.

The answer isn’t to abandon benchmarks—they're too valuable for developers—but to extend measurement beyond them: combine model-centric tests with structured red-teaming and user-driven field-testing, then apply context-aware measurement instruments to judge real impact.

This talk unpacks the limitations of benchmarks and evals, and offers constructive steps to move past proxy games toward evidence of whether systems work safely, fairly, and reliably where it counts—the real world.

Location

2112 Pennsylvania Ave NW

Washington, DC 20037, USA

Head to the front desk to check in and then up to the second floor via the elevators.

Presented by

DC Data & AI Events

Washington D.C. Data & AI Events! Managed by Data Community DC. www.dc2.org

Hosted By

198 Went

AI