ShockLab Seminar: Strategic Vagueness in LLMS

AI Safety South Africa

Register to See Address

Cape Town, South Africa

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

Abstract

This work investigates whether LLMs understand and deploy strategic vagueness, with implications for Al safety. Steven Pinker's theory of indirect speech posits that humans use ambiguous language as a rational strategy when facing audiences with conflicting interests enabling coordination with allies while maintaining plausible deniability against adversaries. We investigate whether LLMs exhibit similar capabilities: can they produce language that evades a trusted monitor while successfully coordinating with an accomplice?

Bio

Pramod Kaushik is a research associate in FBK Italy and a student researcher at the University of Trento. Previously, he has worked at TRDDC Pune, Inrio Bordeaux and Columbia Uni. His recent work on the theory of LLM Sampling won the best paper award at ACL last year. He has previously worked on decision making agents and has previously worked on understanding human decision making and building neurocomputational models of the brain.

---

Housekeeping:

Join the Shocklab events public Google calendar to see upcoming events
You can find selected past events at shocklab.net/seminars.
Sign up to speak here

Location

Please register to see the exact location of this event.

Cape Town, South Africa

Presented by

AI Safety South Africa

Hosted By

2 Went