ShockLab Seminar: Similarity as a Signal: Do Al Agents Cooperate More When They Know They're Alike?

AI Safety South Africa

Register to See Address

Cape Town, South Africa

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

Abstract

The Nash equilibrium for the Prisoner's Dilemma is to defect. Always. But here's a thought: what if you knew the coplayer across from you thought about the world the same way you do? Would you still defect? That's the question we're trying to answer - except instead of people, we're using Al agents. I'll share some early findings from ongoing experiments, a few things that surprised us, and plenty of open questions we haven't resolved yet. Thoughts and feedback very welcome

Bio

Akash Kundu is a final-year Computer Science undergraduate and Cooperative Al Research Fellow with experience in technical Al Safety, focusing on evaluating and stress-testing large language models. His work has uncovered behavioural failures across a range of dimensions - including dark patterns, sycophancy, harmful reasoning, and multilingual vulnerabilities. He has has collaborated with Apart Research, FAR Al, and Humane Intelligence on evaluation pipelines, adversarial prompting, and cross-cultural red-teaming.

---

Housekeeping:

Join the Shocklab events public Google calendar to see upcoming events
You can find selected past events at shocklab.net/seminars.
Sign up to speak here

Location

Please register to see the exact location of this event.

Cape Town, South Africa

Presented by

AI Safety South Africa

Hosted By

5 Went