

AI by Hand ✍️ Seminar: Frontier (PPO➝DPO➝GRPO➝Rubrics)
From PPO to DPO to GRPO to the RL frontier—Rubrics!
Guest Expert
Cameron R. Wolfe, Senior Research Scientist at Netflix, Author of the Deep Learning Focus newsletter (60K+ subscribers).
Preview
PPO
DPO
GRPO
Rubrics-based Rewards
February Seminars
2/5: Meta AI
Guest Expert: Yichen Wang (Meta)
2/12: Transformer
Guest Expert: Srijanie Dey, Tsavorite Scalable Intelligence
2/19: OpenClaw
Guest Expert: Val Andrei Fajardo, LlamaIndex Founding ML Engineer, Author of the Build a Multi-Agent System From Scratch
2/26: PPO➝DPO➝GRPO➝Rubrics
Guest Expert: Cameron R. Wolfe, Netflix, Author of Deep Learning Focus
January Seminars
1/29/2026: 9 AI Eval Formulas
1/22/2026: TPU
1/15/2026: How Small Models Learn Tool Use from AWS
1/8/2026: Manifold-Constrained Hyper Connection (mHC) from DeepSeek
1/8:2026: Introduction to Generative AI
About the Seminar Series
In 2026, I’ve made a personal commitment to teach one live AI seminar every week by hand ✍️.
Foundation: intuition, math, and mental models of core AI concepts, for beginners.
Frontier: research papers, algorithm, and architectures used by frontier models, for advanced AI engineers and researchers.
Live attendance is free.
The recordings and Excel workbooks are available for members of the AI by Hand Academy.