Distribution Compression in Near-linear Time

Author name not available (Why is that?)

Publication date: 15 November 2021

Abstract: In distribution compression, one aims to accurately summarize a probability distribution

m a t h b b P

using a small number of representative points. Near-optimal thinning procedures achieve this goal by sampling

n

points from a Markov chain and identifying

s q r t n

points with

w i d e t i l d e m a t h c a l O (1 / s q r t n)

discrepancy to

m a t h b b P

. Unfortunately, these algorithms suffer from quadratic or super-quadratic runtime in the sample size

n

. To address this deficiency, we introduce Compress++, a simple meta-procedure for speeding up any thinning algorithm while suffering at most a factor of

4

in error. When combined with the quadratic-time kernel halving and kernel thinning algorithms of Dwivedi and Mackey (2021), Compress++ delivers

s q r t n

points with

m a t h c a l O (s q r t l o g n / n)

integration error and better-than-Monte-Carlo maximum mean discrepancy in

m a t h c a l O (n l o g^{3} n)

time and

m a t h c a l O (s q r t n l o g^{2} n)

space. Moreover, Compress++ enjoys the same near-linear runtime given any quadratic-time input and reduces the runtime of super-quadratic algorithms by a square-root factor. In our benchmarks with high-dimensional Monte Carlo samples and Markov chains targeting challenging differential equation posteriors, Compress++ matches or nearly matches the accuracy of its input algorithm in orders of magnitude less time.

Has companion code repository: https://github.com/microsoft/goodpoints

This page was built for publication: Distribution Compression in Near-linear Time

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6383107)