Cover Image for Calendar
Every week we pick one paper and go deep — video generation, world models, physical reasoning, diffusion, flow matching, and everything in between.

Events

You have 0 events pending approval by the calendar admin.
They will show up on the schedule once approved
 
Cover Image for Can Models Think Without Language? Video as the Next Substrate of Intelligence by Zhongang Cai

Can Models Think Without Language? Video as the Next Substrate of Intelligence by Zhongang Cai

By Hokin Deng
Zoom
+43
 
Cover Image for Demystifying Video Reasoning by Ruisi Wang

Demystifying Video Reasoning by Ruisi Wang

By Hokin Deng
Zoom
+12
 
Cover Image for Think Visually, Reason Textually: Vision-Language Synergy in ARC by Beichen Zhang

Think Visually, Reason Textually: Vision-Language Synergy in ARC by Beichen Zhang

By Hokin Deng
Zoom
+4
 
Cover Image for Video Models Can Reason with Verifiable Rewards by Tinghui Zhu

Video Models Can Reason with Verifiable Rewards by Tinghui Zhu

By Hokin Deng
Zoom
+5
 
Cover Image for POSTPONED to July 24: Embodied Reasoning with World Models by Yilun Du

POSTPONED to July 24: Embodied Reasoning with World Models by Yilun Du

By Hokin Deng, Fan-Yun Sun, Charlotte Xia, Shin & 2 others
San Francisco, CA
 
Cover Image for Video Models Are Zero-Shot Learners and Reasoners by Thaddäus Wiedemer

Video Models Are Zero-Shot Learners and Reasoners by Thaddäus Wiedemer

By Hokin Deng
Zoom
+5
 
Cover Image for VChain: Chain-of-Visual-Thought for Reasoning in Video Generation by Ziqi Huang

VChain: Chain-of-Visual-Thought for Reasoning in Video Generation by Ziqi Huang

By Hokin Deng
To Be Announced
 
Cover Image for Do Joint Audio-Video Generation Models Understand Physics? by Zijun Cui

Do Joint Audio-Video Generation Models Understand Physics? by Zijun Cui

By Hokin Deng
Zoom