CiteULike is a free online bibliography manager. Register and you can start organising your references online.
Tags

Over-optimism in bioinformatics: an illustration

by: Monika Jelizarow, Vincent Guillemot, Arthur Tenenhaus, Korbinian Strimmer, Anne-Laure Boulesteix
Bioinformatics, Vol. 26, No. 16. (15 August 2010), pp. 1990-1998, doi:10.1093/bioinformatics/btq323  Key: citeulike:7363255

Formatted Citation


Show HTML

Likes (beta)

This copy of the article hasn't been liked by anyone yet.

View FullText article


Abstract

Motivation: In statistical bioinformatics research, different optimization mechanisms potentially lead to ‘over-optimism’ in published papers. So far, however, a systematic critical study concerning the various sources underlying this over-optimism is lacking.Results: We present an empirical study on over-optimism using high-dimensional classification as example. Specifically, we consider a ‘promising’ new classification algorithm, namely linear discriminant analysis incorporating prior knowledge on gene functional groups through an appropriate shrinkage of the within-group covariance matrix. While this approach yields poor results in terms of error rate, we quantitatively demonstrate that it can artificially seem superior to existing approaches if we ‘fish for significance’. The investigated sources of over-optimism include the optimization of datasets, of settings, of competing methods and, most importantly, of the method's characteristics. We conclude that, if the improvement of a quantitative criterion such as the error rate is the main contribution of a paper, the superiority of new algorithms should always be demonstrated on independent validation data.Availability: The R codes and relevant data can be downloaded from http://www.ibe.med.uni-muenchen.de/organisation/mitarbeiter/020_professuren/boulesteix/overoptimism/, such that the study is completely reproducible.Contact: boulesteix@ibe.med.uni-muenchen.de


Zephyrus's tags for this article

Citations (CiTO)

No CiTO relationships defined

X There are no reviews yet

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History


X Export records

Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.