Cover Image for AI Manipulation Hackathon - Berlin
Cover Image for AI Manipulation Hackathon - Berlin
Avatar for AI Safety Berlin
Presented by
AI Safety Berlin
AI is shaping our future. Let's get it right. Learn more and connect with the community on aisafety.berlin
23 Went

AI Manipulation Hackathon - Berlin

Registration
Past Event
Welcome! To join the event, please register below.
About Event

Are you worried about future AI systems using deception, strategic behavior, or psychological exploitation to achieve their goals at the expense of human values and intentions?

Join other fellow Berliners in participating in Apart Research's AI Manipulation Hackathon. Whether you want to work alone, remotely with another team or join others locally to build a team, we'll be hosting a jam site for you to work and collaborate from.

Location
The Teamwork coworking space in Wedding is graciously hosting us this weekend. The address is Müllerstraße 138D, 13353 Berlin. You can find detailed instructions for getting to Teamwork here.

Food & Drinks
Friday night we'll be ordering pizza for dinner. Arrive at 17:00 to get fueled up while meeting others participating in the hackathon. Throughout the rest of the weekend, we'll be provided some food, snacks and drinks to keep you going.

Schedule
Friday, Jan 9

Saturday, Jan 10

Sunday, Jan 11

Prizes
The top teams will get:

  • $2,000 in cash prizes

  • The change to continue developing via Apart Research's Fellowship program

  • Guaranteed acceptance to present at the International Association for Safe & Ethical AI (IASEAI) workshop in Paris on February 26, 2026

Project Ideas
Projects can include:

  • Manipulation benchmarks that measure persuasive capabilities, deception, and strategic behavior with real ecological validity

  • Detection systems that identify sycophancy, reward hacking, sandbagging, and dark patterns in deployed AI systems

  • Real-world monitoring tools that analyze actual deployment data to catch manipulation in the wild

  • Evidence-based mitigations – MVPs demonstrating novel countermeasures with empirical backing

  • Multi-agent simulations exploring emergent manipulation dynamics and training processes that produce deceptive behavior

  • Pursue other empirical projects that advance our understanding of how AI systems manipulate and how we can stop them

Questions? Lost?

For questions or if you are lost, contact Trevor on Telegram @FastFedora.

Location
Teamwork
Müllerstraße 138D, 13353 Berlin, Germany
Ring “UES | Effektiv Spenden” at the gate. Detailed instructions for finding the office at https://docs.google.com/document/d/1WjAcbm2NUE0YIp…
Avatar for AI Safety Berlin
Presented by
AI Safety Berlin
AI is shaping our future. Let's get it right. Learn more and connect with the community on aisafety.berlin
23 Went