CiteULike is a free online bibliography manager. Register and you can start organising your references online.
Tags

Predicting Mendelian Disease-Causing Non-Synonymous Single Nucleotide Variants in Exome Sequencing Studies

by: Miao-Xin Li, Johnny S. H. Kwan, Su-Ying Bao, Wanling Yang, Shu-Leong Ho, Yong-Qiang Song, Pak C. Sham
PLoS Genet, Vol. 9, No. 1. (17 January 2013), e1003143, doi:10.1371/journal.pgen.1003143  Key: citeulike:11914495

Formatted Citation


Show HTML

Likes (beta)

This copy of the article hasn't been liked by anyone yet.

View FullText article


Abstract

Exome sequencing is becoming a standard tool for mapping Mendelian disease-causing (or pathogenic) non-synonymous single nucleotide variants (nsSNVs). Minor allele frequency (MAF) filtering approach and functional prediction methods are commonly used to identify candidate pathogenic mutations in these studies. Combining multiple functional prediction methods may increase accuracy in prediction. Here, we propose to use a logit model to combine multiple prediction methods and compute an unbiased probability of a rare variant being pathogenic. Also, for the first time we assess the predictive power of seven prediction methods (including SIFT, PolyPhen2, CONDEL, and logit) in predicting pathogenic nsSNVs from other rare variants, which reflects the situation after MAF filtering is done in exome-sequencing studies. We found that a logit model combining all or some original prediction methods outperforms other methods examined, but is unable to discriminate between autosomal dominant and autosomal recessive disease mutations. Finally, based on the predictions of the logit model, we estimate that an individual has around 5% of rare nsSNVs that are pathogenic and carries ~22 pathogenic derived alleles at least, which if made homozygous by consanguineous marriages may lead to recessive diseases. Sequencing the coding regions of the human genome is becoming a standard approach in identifying causal genes for human Mendelian diseases. Researchers often rely on multiple functional prediction methods/tools to separate the candidate causal mutation(s) from other rare mutations in these studies. In this paper, we propose the use of a statistical model to combine prediction scores from multiple methods and to estimate the chance of a rare mutation being Mendelian disease-causing (or pathogenic). We found that our model using all or some individual prediction methods consistently outperforms other prediction methods examined and could exclude more than 55% of rare non-pathogenic mutations in an individual genome. Unfortunately, no method was able to discriminate between autosomal dominant and autosomal recessive disease mutations. In addition, based on the predictions of our model, we estimated that a person can carry ~22 pathogenic derived alleles at least, which if present at the same position in the genome may lead to Mendelian diseases.


dswan's tags for this article

Citations (CiTO)

No CiTO relationships defined

X There are no reviews yet

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History


X Export records

Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.