Cover Image for Demystifying Video Reasoning by Ruisi Wang
Cover Image for Demystifying Video Reasoning by Ruisi Wang
Avatar for Video Model Journal Club
Every week we pick one paper and go deep — video generation, world models, physical reasoning, diffusion, flow matching, and everything in between.
Hosted By
17 Going

Demystifying Video Reasoning by Ruisi Wang

Zoom
Registration
Welcome! To join the event, please register below.
About Event

Abstract: Recent advances in video generation have revealed an unexpected phenomenon: diffusion-based video models exhibit non-trivial reasoning capabilities. We challenge the Chain-of-Frames assumption and uncover a fundamentally different mechanism — Chain-of-Steps (CoS), where reasoning emerges along the diffusion denoising steps. We identify several emergent reasoning behaviors: working memory, self-correction, and perception before action.

Speaker: Ruisi Wang — Researcher with a background in computer science from Nanyang Technological University, working on computer vision, video reasoning, and spatial intelligence.

Website: https://journal.video-reason.com/ To join over zoom, please subscribe to get zoom link: https://forms.gle/ebgyvtLRz8ABTfdX6

Avatar for Video Model Journal Club
Every week we pick one paper and go deep — video generation, world models, physical reasoning, diffusion, flow matching, and everything in between.
Hosted By
17 Going