![]() |
CiteULike | ![]() |
dswan's CiteULike | ![]() |
![]() |
|
![]() |
Register | ![]() |
Log in | ![]() |
TargetMiner: MicroRNA target prediction with systematic identification of tissue specific negative examples |
Reviews
[Write a review of this article]
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
Posting History
AbstractMotivation: Prediction of microRNA (miRNA) target mRNAs using machine learning approaches is an important area of research. However, most of the methods suffer from either high false positive or false negative rates. One reason for this is the marked deficiency of negative examples or miRNA-non target pairs. Systematic identification of non-target mRNAs is still not addressed properly, and therefore, current machine learning approaches are compelled to rely on artificially generated negative examples for training. Results: In this paper we have identified [~]300 tissue specific negative examples using a novel approach that involves expression profiling of both miRNAs and mRNAs, miRNA-mRNA structural interactions and seed site conservation. The newly generated negative examples are validated with pSILAC data set (Selbach et al., 2008) that elucidate the fact that the identified non-targets are indeed non-targets.These high throughput tissue specific negative examples and a set of experimentally verified positive examples are then used to build a system called TargetMiner, a support vector machine (SVM) based classifier. In addition to assessing the prediction accuracy on cross-validation experiments, TargetMiner has been validated with a completely independent experimental test data set. Our method outperforms 10 existing target prediction algorithms and provides a good balance between sensitivity and specificity that is not reflected in the existing methods. We achieve a significantly higher sensitivity and specificity of 69% and 67.8% based on a pool of 90 feature set and 76.5% and 66.1% using a set of 30 selected feature set on the completely independent test data set. In order to establish the effectiveness of the systematically generated negative examples, the SVM is trained using a different set of negative data generated using the method in (Yousef et al., 2007). A significantly higher false positive rate (70.6%) is observed when tested on the independent set, while all other factors are kept the same. Again, when an existing method (NBmiRTar) is executed with the our proposed negative data, we observe an improvement in its performance. These clearly establish the effectiveness of the proposed approach of selecting the negative examples systematically. Availability: TargetMiner is now available as an online tool at www.isical.ac.in/~bioinfo_miu Supplementarymaterials: Supplementarymaterials are available at www.isical.ac.in/~bioinfo_miu/Download.html Contact: sanghami@isical.ac.in, rmitra_t@isical.ac.in 10.1093/bioinformatics/btp503
BibTeX record
RIS record