Cover Image for Community Paper Reading: From Benchmarks to Business Impact: Deploying IBM Generalist Agent in Enterprise Production
Cover Image for Community Paper Reading: From Benchmarks to Business Impact: Deploying IBM Generalist Agent in Enterprise Production
Avatar for Arize AI
Presented by
Arize AI
Generative AI-focused workshops, hackathons, and more. Come build with us!

Community Paper Reading: From Benchmarks to Business Impact: Deploying IBM Generalist Agent in Enterprise Production

Zoom
Registration
Past Event
Welcome! To join the event, please register below.
About Event

Join our upcoming community paper reading, where we'll dive into the latest paper from a team of researchers at IBM: "From Benchmarks to Business Impact: Deploying IBM Generalist Agent in Enterprise Production."

We're excited to host several of the paper's authors, who will walk us through the research and its implications. There will be a live Q&A session, so bring your questions!

The paper reports IBM’s experience developing and piloting the Computer Using Generalist Agent (CUGA), which has been open-sourced for the community. CUGA adopts a hierarchical planner–executor architecture with strong analytical foundations, achieving state-of-the-art performance on AppWorld and WebArena. Beyond benchmarks, it was evaluated in a pilot within the Business-Process-Outsourcing talent acquisition domain, addressing enterprise requirements for scalability, auditability, safety, and governance.

Avatar for Arize AI
Presented by
Arize AI
Generative AI-focused workshops, hackathons, and more. Come build with us!