CiteULike is a free online bibliography manager. Register and you can start organising your references online.

Always Good Turing: Asymptotically Optimal Probability Estimation Export

FOCS

Citation Format

[Posts]

View FullText article


yalding's tags for this article

cryptography estimator probability turing

X Reviews [Write a review of this article]

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History

X Abstract

While deciphering the Enigma Code during World War II, I.J. Good and A.M. Turing considered the problem of estimating a probability distribution from a sample of data. They derived a surprising and un- intuitive formula that has since been used in a va- riety of applications and studied by a number of re- searchers. Borrowing an information-theoretic and machine-learning framework, we de¯ne the attenua- tion of a probability estimator as the largest possible ratio between the per-symbol probability assigned to an arbitrarily-long sequence by any distribution, and the corresponding probability assigned by the estima- tor. We show that some common estimators have in¯- nite attenuation and that the attenuation of the Good- Turing estimator is low, yet larger than one. We then derive an estimator whose attenuation is one, namely, as the length of any sequence increases, the per-symbol probability assigned by the estimator is as high as pos- sible. Interestingly, some of the proofs use celebrated results by Hardy and Ramanujan on the number of partitions of an integer. To better understand the be- havior of the estimator, we study the probability it as- signs to several simple sequences. We show that for some sequences this probability agrees with our intu- ition, while for others it is rather unexpected.


X BibTeX record

X RIS record


Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.