CiteULike is a free online bibliography manager. Register and you can start organising your references online.

Selection of relevant features and examples in machine learning Export

Artificial Intelligence, Vol. 97, No. 1-2. (December 1997), pp. 245-271.

Citation Format

[Posts]

View FullText article


sirandreus's tags for this article

feature-selection relevance

X Reviews [Write a review of this article]

X Notes for this article

sirandreus has 0 private notes and 1 public note for this article.

There is no guarantee that just because a feature is relevant , it will be necessarily useful to an algorithm (and vice versa)

Notation

  • S is the sample set
  • D is the probability distribution of each element of S
  • c is the target function which maps an element from S to a class

Relevance to target 

Definition 1

A feature x_i is relevant to a target concept c if there exists a pair of examples A and B in the instance space such that A and B differ only in ther assignment to x_i and c(A) != c(B).


Relevance as a complexity measure (used in Winnow)

Measures complexity of the target function. Practical goal is not to identify an irrelevant subset of features, but to perform well when the complexity is low.

sirandreus (public note) - 2008-12-01 21:42:42

X Find related articles from these CiteULike users

X Find related articles with these CiteULike tags

X Posting History

X Abstract

In this survey, we review work in machine learning on methods for handling data sets containing large amounts of irrelevant information. We focus on two key issues: the problem of selecting relevant features, and the problem of selecting relevant examples. We describe the advances that have been made on these topics in both empirical and theoretical work in machine learning, and we present a general framework that we use to compare different methods. We close with some challenges for future work in this area.


X BibTeX record

X RIS record


Privacy Statement | Terms & Conditions
CiteULike organises scholarly (or academic) papers or literature and provides bibliographic (which means it makes bibliographies) for universities and higher education establishments. It helps undergraduates and postgraduates. People studying for PhDs or in postdoctoral (postdoc) positions. The service is similar in scope to EndNote or RefWorks or any other reference manager like BibTeX, but it is a social bookmarking service for scientists and humanities researchers.