Name: Hack your way into LLMs
Start: 2026-05-09T10:00:00.000+02:00
End: 2026-05-10T19:30:00.000+02:00
Location: EPFL

Safe AI Lausanne

Can you read a model's mind before it writes the exploit?

SAIL is hosting a mechanistic interpretability hackathon at EPFL. A curated adversarial dataset. Five coding models. One question: what's actually happening inside when they decide to help an attacker?

 Claude Code credits, GPU instances, mentors, and food all weekend. A real shot at a publishable result in 48 hours.

PyTorch fluency + curiosity about model internals. That's the bar.

Hack your way into LLMs

Tommy Chu

Mohsen Akoum

Vinayak Joshi