Cargando…

A Jackknife and Voting Classifier Approach to Feature Selection and Classification

With technological advances now allowing measurement of thousands of genes, proteins and metabolites, researchers are using this information to develop diagnostic and prognostic tests and discern the biological pathways underlying diseases. Often, an investigator’s objective is to develop a classifi...

Descripción completa

Detalles Bibliográficos
Autores principales: Taylor, Sandra L., Kim, Kyoungmi
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3091410/
https://www.ncbi.nlm.nih.gov/pubmed/21584263
http://dx.doi.org/10.4137/CIN.S7111
_version_ 1782203253377728512
author Taylor, Sandra L.
Kim, Kyoungmi
author_facet Taylor, Sandra L.
Kim, Kyoungmi
author_sort Taylor, Sandra L.
collection PubMed
description With technological advances now allowing measurement of thousands of genes, proteins and metabolites, researchers are using this information to develop diagnostic and prognostic tests and discern the biological pathways underlying diseases. Often, an investigator’s objective is to develop a classification rule to predict group membership of unknown samples based on a small set of features and that could ultimately be used in a clinical setting. While common classification methods such as random forest and support vector machines are effective at separating groups, they do not directly translate into a clinically-applicable classification rule based on a small number of features.We present a simple feature selection and classification method for biomarker detection that is intuitively understandable and can be directly extended for application to a clinical setting. We first use a jackknife procedure to identify important features and then, for classification, we use voting classifiers which are simple and easy to implement. We compared our method to random forest and support vector machines using three benchmark cancer ‘omics datasets with different characteristics. We found our jackknife procedure and voting classifier to perform comparably to these two methods in terms of accuracy. Further, the jackknife procedure yielded stable feature sets. Voting classifiers in combination with a robust feature selection method such as our jackknife procedure offer an effective, simple and intuitive approach to feature selection and classification with a clear extension to clinical applications.
format Text
id pubmed-3091410
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-30914102011-05-16 A Jackknife and Voting Classifier Approach to Feature Selection and Classification Taylor, Sandra L. Kim, Kyoungmi Cancer Inform Methodology With technological advances now allowing measurement of thousands of genes, proteins and metabolites, researchers are using this information to develop diagnostic and prognostic tests and discern the biological pathways underlying diseases. Often, an investigator’s objective is to develop a classification rule to predict group membership of unknown samples based on a small set of features and that could ultimately be used in a clinical setting. While common classification methods such as random forest and support vector machines are effective at separating groups, they do not directly translate into a clinically-applicable classification rule based on a small number of features.We present a simple feature selection and classification method for biomarker detection that is intuitively understandable and can be directly extended for application to a clinical setting. We first use a jackknife procedure to identify important features and then, for classification, we use voting classifiers which are simple and easy to implement. We compared our method to random forest and support vector machines using three benchmark cancer ‘omics datasets with different characteristics. We found our jackknife procedure and voting classifier to perform comparably to these two methods in terms of accuracy. Further, the jackknife procedure yielded stable feature sets. Voting classifiers in combination with a robust feature selection method such as our jackknife procedure offer an effective, simple and intuitive approach to feature selection and classification with a clear extension to clinical applications. Libertas Academica 2011-04-27 /pmc/articles/PMC3091410/ /pubmed/21584263 http://dx.doi.org/10.4137/CIN.S7111 Text en © the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited.
spellingShingle Methodology
Taylor, Sandra L.
Kim, Kyoungmi
A Jackknife and Voting Classifier Approach to Feature Selection and Classification
title A Jackknife and Voting Classifier Approach to Feature Selection and Classification
title_full A Jackknife and Voting Classifier Approach to Feature Selection and Classification
title_fullStr A Jackknife and Voting Classifier Approach to Feature Selection and Classification
title_full_unstemmed A Jackknife and Voting Classifier Approach to Feature Selection and Classification
title_short A Jackknife and Voting Classifier Approach to Feature Selection and Classification
title_sort jackknife and voting classifier approach to feature selection and classification
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3091410/
https://www.ncbi.nlm.nih.gov/pubmed/21584263
http://dx.doi.org/10.4137/CIN.S7111
work_keys_str_mv AT taylorsandral ajackknifeandvotingclassifierapproachtofeatureselectionandclassification
AT kimkyoungmi ajackknifeandvotingclassifierapproachtofeatureselectionandclassification
AT taylorsandral jackknifeandvotingclassifierapproachtofeatureselectionandclassification
AT kimkyoungmi jackknifeandvotingclassifierapproachtofeatureselectionandclassification