Cover Image for Demystifying Video Reasoning by Ruisi Wang

Presented by

Every week we pick one paper and go deep — video generation, world models, physical reasoning, diffusion, flow matching, and everything in between.

Hosted By

17 Going

Demystifying Video Reasoning by Ruisi Wang

Video Model Journal Club

Zoom

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

Abstract: Recent advances in video generation have revealed an unexpected phenomenon: diffusion-based video models exhibit non-trivial reasoning capabilities. We challenge the Chain-of-Frames assumption and uncover a fundamentally different mechanism — Chain-of-Steps (CoS), where reasoning emerges along the diffusion denoising steps. We identify several emergent reasoning behaviors: working memory, self-correction, and perception before action.

Speaker: Ruisi Wang — Researcher with a background in computer science from Nanyang Technological University, working on computer vision, video reasoning, and spatial intelligence.

Website: https://journal.video-reason.com/ To join over zoom, please subscribe to get zoom link: https://forms.gle/ebgyvtLRz8ABTfdX6

Presented by

Video Model Journal Club

Every week we pick one paper and go deep — video generation, world models, physical reasoning, diffusion, flow matching, and everything in between.

Hosted By

17 Going