CiteULike is a free online bibliography manager. Register and you can start organising your references online.

Identification of a Preferred Set of Molecular Descriptors for Compound Classification Based on Principal Component Analysis Export

Journal of Chemical Information and Computer Sciences, Vol. 39, No. 4. (1 July 1999), pp. 699-704.

Citation Format

[Posts]

View FullText article


stharward's tags for this article

compound-classification feature-selection fingerprinting pca

X Reviews [Write a review of this article]

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History

X Abstract

An algorithm based on principal component analysis was investigated to classify molecules in a database consisting of 455 compounds with activities against seven different biological targets. Diversity profiles of these compound sets were calculated and compared. To effectively classify compounds with similar biological activity, all possible combinations of 17 molecular descriptors were tested by complete factorial analysis, and preferred descriptor combinations were identified. High efficiency was achieved for a combination of a limited set of structural keys and two or three additional 2D descriptors. The performance of the approach was compared to JarvisPatrick clustering.


X BibTeX record

X RIS record


Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.