CiteULike is a free online bibliography manager. Register and you can start organising your references online.

Genetic algorithms for simultaneous variable and sample selection in metabonomics. Export

Bioinformatics (Oxford, England) (14 November 2008)

Citation Format

[Posts]

View FullText article


gulkur's tags for this article

algorithms application genetic

X Reviews [Write a review of this article]

X Notes for this article

gulkur has 1 private note and 0 public notes for this article. If you are gulkur then you can log in to see the private note.

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History

X Abstract

MOTIVATION: Metabolic profiles derived from high resolution (1)H-NMR data are complex, therefore statistical and machine learning approaches are vital for extracting useful information and biological insights. Focused modelling on targeted subsets of metabolites and samples can improve the predictive ability of models, and techniques such as genetic algorithms (GAs) have a proven utility in feature selection problems. The Consortium for Metabonomic Toxicology (COMET) obtained temporal NMR spectra of urine from rats treated with model toxins and stressors. Here we develop a GA approach which simultaneously selects sets of samples and spectral regions from the COMET database to build robust, predictive classifiers of liver and kidney toxicity. RESULTS: The results indicate that using simultaneous sample and variable selection improved performance by over 9% compared with either method alone. Simultaneous selection also halved computation time. Successful classifiers repeatedly selected particular variables indicating that this approach can aid defining biomarkers of toxicity. Novel visualisations of the results from multiple computations were developed to aid the interpretability of which samples and variables were frequently selected. This method provides an efficient way to determine the most discriminatory variables and samples for any post-genomic dataset. AVAILABILITY: GA code available from http://www1.imperial.ac.uk/medicine/people/r.cavill/ CONTACT: r.cavill@imperial.ac.uk, t.ebbels@imperial.ac.uk.


X BibTeX record

X RIS record


Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.