Probabilistic counting algorithms for data base applications (Q1069325): Difference between revisions

This paper introduces a class of probabilistic counting algorithms with which one can estimate the number of distinct elements in a large collection of data (typically a large file stored on disk) in a single pass using only a small additional storage (typically less than a hundred binary words) and only a few operations per element scanned. The algorithms are based on statistical observations made on bits of hashed values of records. They are by construction totally insensitive to the replicative structure of elements in the file; they can be used in the context of distributed systems without any degradation of performances and prove especially useful in the context of data bases query optimisation.

0 references

zbMATH Keywords

number of distinct elements in a large collection of data

0 references

0 references

0 references

Approximate counting: a detailed analysis

0 references

Q4057549

0 references

Counting large numbers of events in small registers

0 references

Sorting and Searching in Multisets

0 references

Identifiers

zbMATH Open document ID

0583.68059

0 references

DOI

10.1016/0022-0000(85)90041-8

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

journals/jcss/FlajoletM85

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1069325

@@ description / en / description / en @@
-scientific article
+scientific article; zbMATH DE number 3934444
@@ Property / OpenAlex ID @@
+W2025051251
@@ Property / OpenAlex ID: W2025051251 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5799604
@@ Property / cites work: Q5799604 / rank @@
+Normal rank
@@ Property / cites work @@
+Approximate counting: a detailed analysis
@@ Property / cites work: Approximate counting: a detailed analysis / rank @@
+Normal rank
@@ Property / cites work @@
+Q4057549
@@ Property / cites work: Q4057549 / rank @@
+Normal rank
@@ Property / cites work @@
+Counting large numbers of events in small registers
+Normal rank
@@ Property / cites work @@
+Sorting and Searching in Multisets
@@ Property / cites work: Sorting and Searching in Multisets / rank @@
+Normal rank
@@ Property / DBLP publication ID @@
+journals/jcss/FlajoletM85
@@ Property / DBLP publication ID: journals/jcss/FlajoletM85 / rank @@
+Normal rank