Stein Thinning


Optimally thinning of output from a sampling procedure, such as MCMC. Here the red samples are automatically chosen by Stein Thinning to provide a more accurate approximation to the distributional target, compared with the original MCMC output. [Read more]

View the Project on GitHub wilson-ye-chen/stein_thinning_start


Stein Thinning is a tool for post-processing the output of a sampling procedure, such as Markov chain Monte Carlo (MCMC). It aims to minimise a Stein discrepancy, selecting a subsequence of samples that best represent the distributional target.

The user provides two arrays: one containing the samples and another containing the corresponding gradients of the log-target. Stein Thinning returns a vector of indices, indicating which samples were selected.

In favourable circumstances, Stein Thinning is able to:


Implementations of Stein Thinning are currently available for Python and MATLAB:

Get Started

First, it is important to parametrise the distributional target so that it has a positive and differentiable density formula on formula.

In Python, MATLAB, or R, it takes a single function call to start Stein Thinning:

indices = thin(samples, gradients, m)


Stan Examples

Stein Thinning can be used to post-process the output directly from the Stan family of probabilistic programming languages:

Citing Stein Thinning

Riabiz M, Chen WY, Cockayne J, Swietach P, Niederer SA, Mackey L, Oates CJ (2020) Optimal Thinning of MCMC Output. arXiv

Research Based on Stein Thinning

Teymur O, Gorham J, Riabiz M, Oates CJ (2021) Optimal Quantisation of Probability Measures Using Maximum Mean Discrepancy. International Conference on Artificial Intelligence and Statistics (AISTATS 2021). Paper