Cover Image for ShockLab Seminar: Similarity as a Signal: Do Al Agents Cooperate More When They Know They're Alike?
Cover Image for ShockLab Seminar: Similarity as a Signal: Do Al Agents Cooperate More When They Know They're Alike?
Avatar for AI Safety South Africa
5 Went
Private Event

ShockLab Seminar: Similarity as a Signal: Do Al Agents Cooperate More When They Know They're Alike?

Register to See Address
Cape Town, South Africa
Registration
Past Event
Welcome! To join the event, please register below.
About Event

Abstract

The Nash equilibrium for the Prisoner's Dilemma is to defect. Always. But here's a thought: what if you knew the coplayer across from you thought about the world the same way you do? Would you still defect? That's the question we're trying to answer - except instead of people, we're using Al agents. I'll share some early findings from ongoing experiments, a few things that surprised us, and plenty of open questions we haven't resolved yet. Thoughts and feedback very welcome

Bio

Akash Kundu is a final-year Computer Science undergraduate and Cooperative Al Research Fellow with experience in technical Al Safety, focusing on evaluating and stress-testing large language models. His work has uncovered behavioural failures across a range of dimensions - including dark patterns, sycophancy, harmful reasoning, and multilingual vulnerabilities. He has has collaborated with Apart Research, FAR Al, and Humane Intelligence on evaluation pipelines, adversarial prompting, and cross-cultural red-teaming.

---

Housekeeping:

Location
Please register to see the exact location of this event.
Cape Town, South Africa
Avatar for AI Safety South Africa
5 Went