![]() |
CiteULike | ![]() |
sirandreus's CiteULike | ![]() |
![]() |
|
![]() |
Register | ![]() |
Log in | ![]() |
Selection of relevant features and examples in machine learningby: A. Blum
|
Reviews
[Write a review of this article]
Notes for this articleThere is no guarantee that just because a feature is relevant , it will be necessarily useful to an algorithm (and vice versa)
Notation
- S is the sample set
- D is the probability distribution of each element of S
- c is the target function which maps an element from S to a class
Relevance to target Definition 1
A feature x_i is relevant to a target concept c if there exists a pair of examples A and B in the instance space such that A and B differ only in ther assignment to x_i and c(A) != c(B).
Relevance as a complexity measure (used in Winnow)
Measures complexity of the target function. Practical goal is not to identify an irrelevant subset of features, but to perform well when the complexity is low.
Find related articles from these CiteULike users
Find related articles with these CiteULike tags
Posting History
AbstractIn this survey, we review work in machine learning on methods for handling data sets containing large amounts of irrelevant information. We focus on two key issues: the problem of selecting relevant features, and the problem of selecting relevant examples. We describe the advances that have been made on these topics in both empirical and theoretical work in machine learning, and we present a general framework that we use to compare different methods. We close with some challenges for future work in this area.
BibTeX record
RIS record