Tight Bounds for Hashing Block Sources
From MaRDI portal
Publication:3541807
DOI10.1007/978-3-540-85363-3_29zbMATH Open1159.68637arXiv0806.1948OpenAlexW2119574607MaRDI QIDQ3541807
Kai-Min Chung, Salil P. Vadhan
Publication date: 27 November 2008
Published in: Lecture Notes in Computer Science (Search for Journal in Brave)
Abstract: It is known that if a 2-universal hash function is applied to elements of a {em block source} , where each item has enough min-entropy conditioned on the previous items, then the output distribution will be ``close to the uniform distribution. We provide improved bounds on how much min-entropy per item is required for this to hold, both when we ask that the output be close to uniform in statistical distance and when we only ask that it be statistically close to a distribution with small collision probability. In both cases, we reduce the dependence of the min-entropy on the number of items from in previous work to , which we show to be optimal. This leads to corresponding improvements to the recent results of Mitzenmacher and Vadhan (SODA `08) on the analysis of hashing-based algorithms and data structures when the data items come from a block source.
Full work available at URL: https://arxiv.org/abs/0806.1948
Related Items (3)
On the effective block size in Harper's theorem ⋮ Optimal block size for set-valued attributes ⋮ Distinguishing Distributions Using Chernoff Information
This page was built for publication: Tight Bounds for Hashing Block Sources