Cover Image for Adversarial Defenses for LLMs

Presented by

Trajectory Labs

Catalyzing Toronto's role in steering AI progress toward a future of human flourishing.

Join us for a variety of events on technical AI safety, governance in a world of advanced AI, and more.

Hosted By

52 Went

AI

Adversarial Defenses for LLMs

Name: Adversarial Defenses for LLMs
Start: 2026-03-26T18:00:00.000-04:00
End: 2026-03-26T21:00:00.000-04:00
Location: 30 Adelaide St E

Trajectory Labs

30 Adelaide St E

Toronto, Canada

Past Event

Welcome! Please choose your desired ticket type:

You will be asked to verify token ownership with your wallet.

About Event

In his talk, Samuel Simko from ETH Zurich will present his recent work on adversarial defenses for LLMs, developed with the Jinesis Lab (University of Toronto). The talk will cover a series of approaches, ranging from triplet-based contrastive learning defenses to honeypot-style defenses designed to avoid worst-case behavior. He will also discuss patterns observed in contest-winning manual jailbreaking prompts, ideas for tamper-resistant safeguards, and the current limits of attacks, defenses, and evaluation methodologies.

Location

30 Adelaide St E

Toronto, ON M5C 3G8, Canada

Enter the main lobby of the building and let the security staff know you are here for the AI event. You may need to show your RSVP on your phone. You will be directed to the 12th floor where the meetup is held. If you have trouble getting in, give Georgia a call at 519-981-0360.

Presented by

Trajectory Labs

Catalyzing Toronto's role in steering AI progress toward a future of human flourishing.

Join us for a variety of events on technical AI safety, governance in a world of advanced AI, and more.

Hosted By

52 Went

AI