Mathematics of diffusion models
November 6-7th, 2025

Overview

In this workshop the goal is to understand as much as possible diffusion models from the point of view of mathematics. We will have a short course of 3 hours the first day in the afternoon, and a practical session the second day in the morning.

Registration

Registration is free but mandatory. To register please fill the form before Friday, the 24th of October here.

Venue

FRUMAM, St Charles, salle de séminaire étage 2
Aix-Marseille Université
3, place Victor Hugo - MARSEILLE Cedex 03

Program

November, 6th 13.00:13:30		Coffee and introduction
November, 6th 13:30-16:30	C. Boyer	Course: mechanisms and challenges in Score-Based Learning Abstract.
Abstract We will focus on score-based generative models, which rely on diffusion equations. If necessary, we will begin with a brief review of stochastic calculus in order to properly introduce and handle the mathematical objects involved. Once the models are presented, we will discuss the typical generalization bounds obtained, before turning to more recent developments that aim to shed light on favorable mechanisms arising during score learning. In particular, we will examine the phenomenon of memorization and how it can be mitigated by using sufficiently large steps in the SGD strategy employed to learn the score. Finally, we will highlight the connection between the diffusion stopping time and the underlying structure of the data.
November, 7th 9:00 -12:00	E. Claeys	Practical session: diffusion models from theory to reality Abstract.
Abstract Diffusion models are a powerful framework for generative modeling, with applications ranging from image, audio or tabular data synthesis to LLM, reinforcement learning and beyond. But moving from elegant theory to practical implementation raises some key questions: how do you build, load and customise a model that truly fits your problem? How can you generate highly specific data? How can you detect and evaluate biases in existing models? In this seminar, we’ll explore practical guidelines for selecting and implementing diffusion models, balancing trade-offs such as computation time, cost, trustworthiness and task efficiency. We’ll also dive into bias in generative AI, showing how revealing and measuring bias can be a challenging task, especially for large language models (LLMs). A hands-on case study will illustrate the strengths and limitations of the diffusion-model approach, sparking discussion on what it really takes to bring diffusion models from theory to real-world applications. A practical session will be conducted in Python.

Confirmed speakers

Claire Boyer (Institut de mathématiques d'Orsay, Université Paris-Saclay)
Emmanuelle Claeys (Institut de Recherche en Informatique de Toulouse, Université Paul Sabatier, Toulouse)

Organizers

Sandrine Anthoine.
Caroline Chaux.
Clothilde Mélot.