Deprecated: $wgMWOAuthSharedUserIDs=false is deprecated, set $wgMWOAuthSharedUserIDs=true, $wgMWOAuthSharedUserSource='local' instead [Called from MediaWiki\HookContainer\HookContainer::run in /var/www/html/w/includes/HookContainer/HookContainer.php at line 135] in /var/www/html/w/includes/Debug/MWDebug.php on line 372
Which Distribution Distances are Sublinearly Testable? - MaRDI portal

Which Distribution Distances are Sublinearly Testable?

From MaRDI portal
Publication:4608070

zbMATH Open1409.62043arXiv1708.00002MaRDI QIDQ4608070

Author name not available (Why is that?)

Publication date: 15 March 2018

Abstract: Given samples from an unknown distribution p and a description of a distribution q, are p and q close or far? This question of "identity testing" has received significant attention in the case of testing whether p and q are equal or far in total variation distance. However, in recent work, the following questions have been been critical to solving problems at the frontiers of distribution testing: -Alternative Distances: Can we test whether p and q are far in other distances, say Hellinger? -Tolerance: Can we test when p and q are close, rather than equal? And if so, close in which distances? Motivated by these questions, we characterize the complexity of distribution testing under a variety of distances, including total variation, ell2, Hellinger, Kullback-Leibler, and chi2. For each pair of distances d1 and d2, we study the complexity of testing if p and q are close in d1 versus far in d2, with a focus on identifying which problems allow strongly sublinear testers (i.e., those with complexity O(n1gamma) for some gamma>0 where n is the size of the support of the distributions p and q). We provide matching upper and lower bounds for each case. We also study these questions in the case where we only have samples from q (equivalence testing), showing qualitative differences from identity testing in terms of when tolerance can be achieved. Our algorithms fall into the classical paradigm of chi2-statistics, but require crucial changes to handle the challenges introduced by each distance we consider. Finally, we survey other recent results in an attempt to serve as a reference for the complexity of various distribution testing problems.


Full work available at URL: https://arxiv.org/abs/1708.00002






Related Items (2)


Recommendations





This page was built for publication: Which Distribution Distances are Sublinearly Testable?

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4608070)