Kernel Thinning

Author name not available (Why is that?)

Publication date: 12 May 2021

Abstract: We introduce kernel thinning, a new procedure for compressing a distribution

m a t h b b P

more effectively than i.i.d. sampling or standard thinning. Given a suitable reproducing kernel

m a t h b f k_{s t a r}

and

m a t h c a l O (n^{2})

time, kernel thinning compresses an

n

-point approximation to

m a t h b b P

into a

s q r t n

-point approximation with comparable worst-case integration error across the associated reproducing kernel Hilbert space. The maximum discrepancy in integration error is

m a t h c a l O_{d} (n^{- 1 / 2} s q r t l o g n)

in probability for compactly supported

m a t h b b P

and

m a t h c a l O_{d} (n^{- f r a c 12} (l o g n)^{(d + 1) / 2} s q r t l o g l o g n)

for sub-exponential

m a t h b b P

on

m a t h b b R^{d}

. In contrast, an equal-sized i.i.d. sample from

m a t h b b P

suffers

O m e g a (n^{- 1 / 4})

integration error. Our sub-exponential guarantees resemble the classical quasi-Monte Carlo error rates for uniform

m a t h b b P

on

[0, 1]^{d}

but apply to general distributions on

m a t h b b R^{d}

and a wide range of common kernels. Moreover, the same construction delivers near-optimal

L^{i} n f t y

coresets in

m a t h c a l O (n^{2})

time. We use our results to derive explicit non-asymptotic maximum mean discrepancy bounds for Gaussian, Mat'ern, and B-spline kernels and present two vignettes illustrating the practical benefits of kernel thinning over i.i.d. sampling and standard Markov chain Monte Carlo thinning, in dimensions

d = 2

through

100

.

Has companion code repository: https://github.com/microsoft/goodpoints

This page was built for publication: Kernel Thinning

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6367524)