CiteULike is a free online bibliography manager. Register and you can start organising your references online.

Probabilistic Counting Algorithms for Data Base Applications Export

Journal of Computer and System Sciences, Vol. 31, No. 2. (1985), pp. 182-209.

Citation Format

[Posts]

View FullText article


sfuniak's tags for this article

aggregation sketches

X Reviews [Write a review of this article]

X Notes for this article

sfuniak has 1 private note and 0 public notes for this article. If you are sfuniak then you can log in to see the private note.

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History

X Abstract

This paper introduces a class of probabilistic counting lgorithms with which one can estimate the number of distinct elements in a large collection of data (typically a large file stored on disk) in a single pass using only a small additional storage (typically less than a hundred binary words) and only a few operations per element scanned. The algorithms are based on statistical observations made on bits of hashed values of records. They are by con- struction totally insensitive to the...


X BibTeX record

X RIS record


Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.