Which Distribution Distances are Sublinearly Testable?
From MaRDI portal
Publication:4608070
zbMATH Open1409.62043arXiv1708.00002MaRDI QIDQ4608070
Author name not available (Why is that?)
Publication date: 15 March 2018
Abstract: Given samples from an unknown distribution and a description of a distribution , are and close or far? This question of "identity testing" has received significant attention in the case of testing whether and are equal or far in total variation distance. However, in recent work, the following questions have been been critical to solving problems at the frontiers of distribution testing: -Alternative Distances: Can we test whether and are far in other distances, say Hellinger? -Tolerance: Can we test when and are close, rather than equal? And if so, close in which distances? Motivated by these questions, we characterize the complexity of distribution testing under a variety of distances, including total variation, , Hellinger, Kullback-Leibler, and . For each pair of distances and , we study the complexity of testing if and are close in versus far in , with a focus on identifying which problems allow strongly sublinear testers (i.e., those with complexity for some where is the size of the support of the distributions and ). We provide matching upper and lower bounds for each case. We also study these questions in the case where we only have samples from (equivalence testing), showing qualitative differences from identity testing in terms of when tolerance can be achieved. Our algorithms fall into the classical paradigm of -statistics, but require crucial changes to handle the challenges introduced by each distance we consider. Finally, we survey other recent results in an attempt to serve as a reference for the complexity of various distribution testing problems.
Full work available at URL: https://arxiv.org/abs/1708.00002
Nonparametric hypothesis testing (62G10) Analysis of algorithms and problem complexity (68Q25) Characterization and structure theory of statistical distributions (62E10)
Related Items (2)
Hypothesis testing for high-dimensional multinomials: a selective review โฎ Analysis of COVID-19 evolution based on testing closeness of sequential data
Recommendations
- Title not available (Why is that?) ๐ ๐
- Is submodularity testable? ๐ ๐
- Distribution-free testing for monomials with a sublinear number of queries ๐ ๐
- Sublinear algorithms for testing monotone and unimodal distributions ๐ ๐
- Distribution-Free Testing Algorithms for Monomials with a Sublinear Number of Queries ๐ ๐
- Distribution-free tests of subhypotheses ๐ ๐
- Near-Optimal Closeness Testing of Discrete Histogram Distributions ๐ ๐
- Distribution Testing Lower Bounds via Reductions from Communication Complexity ๐ ๐
- Optimal Algorithms for Testing Closeness of Discrete Distributions ๐ ๐
- Testing Closeness of Discrete Distributions ๐ ๐
This page was built for publication: Which Distribution Distances are Sublinearly Testable?
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4608070)