Agents Behaving Badly: The Perils of Pushing AI Agents into Production

Name: Agents Behaving Badly: The Perils of Pushing AI Agents into Production
Start: 2026-06-25T18:00:00.000-04:00
End: 2026-06-25T20:00:00.000-04:00
Location: New York, NY

Arklex AI

Register to See Address

New York, NY

Registration Closed

This event is not currently taking registrations. You may contact the host or subscribe to receive updates.

About Event

As companies move beyond prototypes and pilots, launching AI agents in production, evaluation becomes one of the hardest unsolved problems.

How do you measure reliability, catch regressions, and build trust in systems that are non-deterministic and increasingly autonomous? How do you prevent embarrassing drift? How do you stop your customer support agent from offering discounts on products that don't exist, or going completely off-script?

That's the core tension we want to explore.

Speakers:

Kilian Lieret, PhD - AI Research Scientist, Meta Superintelligence
Zhou Yu - Co-Founder & CEO of Arklex.AI & CS Professor at Columbia University

(Full speaker lineup to be announced)

Agenda:

5 to 10 min: Welcome and framing
30 min: Moderated panel with live audience Q&A
30 min: Networking

Who should attend:

Whether you're an engineer building agent systems, a product leader deciding where to deploy AI, or an executive navigating the risks of putting AI agents in front of customers, this event is for you.

Food and beverages provided

Space is limited to 50 attendees.

Location

Please register to see the exact location of this event.

New York, NY

Presented by

Arklex AI

AI agent evaluation & testing

Hosted By

72 Went

AI