Debjyoti Paul - Softmax, Backprop, and the Autograd: The Hidden Machinery Behind Deep Learning

ML Math - Cohere Labs Open Science Community

Google Meet

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

Deep learning did not become powerful because neural networks were invented in 2010. The ideas were already there for decades. What changed was the perfect alignment of **softmax, cross-entropy, reverse-mode autodiff, GPUs, optimizers, and scale**.

This tells the mathematical and algorithmic story behind that alignment.

We begin from first principles: why neural networks produce raw scores called logits, how softmax converts those scores into probabilities, and why the softmax + cross-entropy pair gives one of the cleanest gradients in machine learning.

From there, we go deeper into the machinery of learning itself: backpropagation, reverse-mode automatic differentiation, vector-Jacobian products, computational graphs, and why PyTorch’s .backward() is so central to modern AI. We contrast this with forward-mode autodiff and explain why backward-mode is the right fit for deep learning’s core shape

The story also places deep learning in historical context. We compare it with the older worlds of **SVMs, kernel methods, and XGBoost**, showing why margin-based learning and tree boosting dominated many problems before deep representation learning became practical.

Finally, we connect the 1990s to the 2010s: what changed, why GPUs mattered, how autograd frameworks changed research culture, why softmax became central not only in classification but also in attention, and how these pieces together created the modern deep learning machine

Bio:
Debjyoti is a Data Scientist at Amazon with over 9 years of industrial experience in Natural Language Processing (NLP), Large Language Models (LLMs), and Agentic AI and Responsible AI, Currently Debjyoti is Leading Agentic development and actively working on Agent learning primarily focusing on Agentic System improvement from Context Engineering to RL based Learning framework. Prior to this Debjyoti has led AI-driven solutions in enterprise applications focusing on Anomaly Detection, Recommendation system, NLP, Information extraction and Computer Vision. Their research expertise spans AI ethical AI governance, bias mitigation, and scalable LLM deployment, model interpretability ensuring responsible AI adoption across industries. With hands-on experience in developing production-grade AI systems, they actively research fairness, robustness, and transparency in AI, contributing to frameworks that enhance trust and accountability in AI-driven decision-making.

Presented by

ML Math - Cohere Labs Open Science Community

Led by Katrina Lawrence and Neel Ghoshal. Part of the Cohere Labs Open Science initiative https://cohere.com/research/open-science

Hosted By