CiteULike is a free online bibliography manager. Register and you can start organising your references online.

Data Mining as an Industry Export

The Review of Economics and Statistics, Vol. 67, No. 1. (1985), pp. 124-127.

Citation Format

[Posts]

View FullText article


lionicebear's tags for this article

data_mining

X Reviews [Write a review of this article]

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History

X Abstract

"Data mining" by an individual investigator can distort the probabilities in conventional significance tests. This paper argues that the same effect can occur when a given data set is used by more than one investigator, even if no individual investigator engages in data mining. A problem of publication selection bias is recalled and note is taken of its implications for the interpretation of published test results when there is collective data mining. Some illustrative calculations of probabilities associated with collective data mining are provided.


X BibTeX record

X RIS record


Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.