Cover Image for CAIRF Seminar Series: You can't study game theoretic dynamics if you can't compute equilibria | Eugene Vinitsky
Cover Image for CAIRF Seminar Series: You can't study game theoretic dynamics if you can't compute equilibria | Eugene Vinitsky
Avatar for AI Safety South Africa
Hosted By
8 Went
Private Event

CAIRF Seminar Series: You can't study game theoretic dynamics if you can't compute equilibria | Eugene Vinitsky

Register to See Address
Cape Town, South Africa
Registration
Past Event
Welcome! To join the event, please register below.
About Event

In the past decade, motivated by the putative failure of naive self-play deep reinforcement learning (DRL) in adversarial imperfect-information games, researchers have developed numerous DRL algorithms based on fictitious play (FP), double oracle (DO), and counterfactual regret minimization (CFR). In light of recent results of the magnetic mirror descent algorithm, we hypothesize that simpler generic policy gradient methods like PPO are competitive with or superior to these FP-, DO-, and CFR-based DRL approaches. To facilitate the resolution of this hypothesis, we implement and release the first broadly accessible exact exploitability computations for four large games. Using these games, we conduct the largest-ever exploitability comparison of DRL algorithms for imperfect-information games. Over 5600 training runs, we find that FP-, DO-, and CFR-based approaches fail to outperform generic policy gradient methods. Code is available at https://github.com/nathanlct/IIG-RL-Benchmark and https://github.com/gabrfarina/exp-a-spiel

See Eugene Vinitsky's website here.



About the event's hybrid nature:

The speaker (Eugene) will be calling in for the talk and we will have an in-person audience as well as the option to dial in remotely.

Note that we have limited capacity for this event so register early to ensure you get a spot.

Location
Please register to see the exact location of this event.
Cape Town, South Africa
Avatar for AI Safety South Africa
Hosted By
8 Went