Deep Dives May 10, 2024 15 min read
Mathematical Intuition behind Diffusion Models
At their core, diffusion models are about destroying and reconstructing information. We take an image and slowly add Gaussian noise until it is pure static. The model learns the reverse process: how to take static and denoise it step-by-step into a coherent image.
The Forward Process
We can model the forward diffusion process as a Markov chain. Let be the original image. At each step , we add a small amount of Gaussian noise:
Where is a variance schedule. As , approaches an isotropic Gaussian distribution .
The Reverse Process
The goal is to learn the reverse distribution . Since the exact reverse is intractable, we approximate it with a neural network (usually a U-Net):

This simple idea connects thermodynamics (non-equilibrium statistical physics) with modern deep learning.
Thanks for reading.