Cover Image for AI Safety Thursday: Modeling and Detecting Deceptive Alignment

Presented by

Catalyzing Toronto's role in steering AI progress toward a future of human flourishing. Join us for a variety of events on technical AI safety, governance in a world of advanced AI, and more.

Hosted By

1 Going

AI

AI Safety Thursday: Modeling and Detecting Deceptive Alignment

Name: AI Safety Thursday: Modeling and Detecting Deceptive Alignment
Start: 2025-10-16T18:00:00.000-04:00
End: 2025-10-16T21:00:00.000-04:00
Location: 30 Adelaide St E 12th floor

Trajectory Labs

30 Adelaide St E 12th floor

Toronto, Ontario

Ticket Price

CA$5.00

Welcome! To join the event, please get your ticket below.

You will be asked to verify token ownership with your wallet.

About Event

Annie Szorkin gives a talk on Modeling and Detecting Deceptive Alignment

Event Schedule
6:00 to 6:30 - Food and introductions
6:30 to 7:30 - Presentation and Q&A
7:30 to 9:00 - Open Discussions

If you can't attend in person, join our live stream starting at 6:30 pm via this link.

This is part of our weekly AI Safety Thursdays series. Join us in examining questions like:

How do we ensure AI systems are aligned with human interests?
How do we measure and mitigate potential risks from advanced AI systems?
What does safer AI development look like?

Location