Cover Image for Maximizing the value of reinforcement learning

Presented by

South Park Commons

South Park Commons helps you get from -1 to 0. To learn more or apply, visit southparkcommons.com.

Hosted By

209 Went

YZ

Featured in

San Francisco

Maximizing the value of reinforcement learning

Name: Maximizing the value of reinforcement learning
Start: 2025-12-11T17:00:00.000-08:00
End: 2025-12-11T19:30:00.000-08:00
Location: San Francisco, California

South Park Commons

Register to See Address

San Francisco, California

Past Event

Please click on the button below to join the waitlist. You will be notified if additional spots become available.

You will be asked to verify token ownership with your wallet.

About Event

Despite decades of research in RL on sample efficiency and generalization, the scaling of RL currently centers around policy gradient methods that often eschew not only these prior innovations, but even value functions. What aspects of the pre-LLM RL field are most promising to revisit? Is there a place for temporal difference based methods in the transformer-based era of RL?

Join Vmax and SPC for a panel discussion featuring Danijar Hafner (creator of Dreamer) and Ashvin Nair (ex-OpenAI, now RL foundations at Cursor) and Evgenii Nikishin (OpenAI, contributor to GPT-5, o3, co-author of the primacy bias in RL) on which parts of pre-LLM reinforcement learning are likely to be the most fruitful in the LLM era.

Schedule:

5:00pm doors open
5:30pm - 6:30pm panel discussion
6:30pm - 7:30pm drinks

Who should attend: Researchers and engineers interested in RL.

Location

Please register to see the exact location of this event.

San Francisco, California

Presented by

South Park Commons

South Park Commons helps you get from -1 to 0. To learn more or apply, visit southparkcommons.com.

Hosted By

209 Went

YZ