In statistics and machine learning, feature selection (also called attribute selection, variable subset selection or variable selection) is the process of proposing a subset of related features (variables, predictors) for the construction of a model. Feature selection techniques are widely applied for three reasons: simplification of models to make them easier to interpret by users/researchers, shorter training times, enhanced generalization by reducing over fitting. The central premise when adopting a feature selection technique is that the data contains lots of features that is either irrelevant or redundant, and can thus be removed without causing much loss of information. Irrelevant or redundant features are two different notions, since one relevant feature may be redundant in the presence of another relevant feature with which it is correlated to a large extent.
A feature selection algorithm can be considered as the combination of a search technique for selecting new feature subsets and an evaluation measure which scores the distinct feature subsets. The simplest algorithm is to test each subset of features finding the one that minimizes the error rate. This is an exhaustive search of the space, and is computationally intractable for all but the smallest of feature sets. The choice of evaluation metric heavily influences the algorithm, and it is these evaluation metrics that distinguish between the three main sections of feature selection algorithms:
- Wrapper methods use a predictive model to score feature subsets. Each new subset is applied to train a model, which is tested on a hold-out set.
- Filter methods use a proxy measure rather than the error rate to score a feature subset.
- Embedded methods are a catch-all group of techniques which carry out feature selection as part of the model construction process.
Applications of feature selection
- SNPs study
- Microarray
- Spectral Mass
- Disease research(e.g. Alzheimer's disease)
How to place an order:
*If your organization requires signing of a confidentiality agreement, please contact us by email
As one of the leading omics industry company in the world! Creative Proteomics now is opening to provide feature selection analysis service for our customers. With rich experience in the field of bioinformatics, we are willing to provide our customers the most outstanding service! Contact us for all the detailed information!