Cargando…
Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework
Identification of suitable biomarkers for accurate prediction of phenotypic outcomes is a goal for personalized medicine. However, current machine learning approaches are either too complex or perform poorly. Here, a novel two-step machine-learning framework is presented to address this need. First,...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3784073/ https://www.ncbi.nlm.nih.gov/pubmed/24106694 http://dx.doi.org/10.1155/2013/148014 |
_version_ | 1782285762283175936 |
---|---|
author | Assawamakin, Anunchai Prueksaaroon, Supakit Kulawonganunchai, Supasak Shaw, Philip James Varavithya, Vara Ruangrajitpakorn, Taneth Tongsima, Sissades |
author_facet | Assawamakin, Anunchai Prueksaaroon, Supakit Kulawonganunchai, Supasak Shaw, Philip James Varavithya, Vara Ruangrajitpakorn, Taneth Tongsima, Sissades |
author_sort | Assawamakin, Anunchai |
collection | PubMed |
description | Identification of suitable biomarkers for accurate prediction of phenotypic outcomes is a goal for personalized medicine. However, current machine learning approaches are either too complex or perform poorly. Here, a novel two-step machine-learning framework is presented to address this need. First, a Naïve Bayes estimator is used to rank features from which the top-ranked will most likely contain the most informative features for prediction of the underlying biological classes. The top-ranked features are then used in a Hidden Naïve Bayes classifier to construct a classification prediction model from these filtered attributes. In order to obtain the minimum set of the most informative biomarkers, the bottom-ranked features are successively removed from the Naïve Bayes-filtered feature list one at a time, and the classification accuracy of the Hidden Naïve Bayes classifier is checked for each pruned feature set. The performance of the proposed two-step Bayes classification framework was tested on different types of -omics datasets including gene expression microarray, single nucleotide polymorphism microarray (SNParray), and surface-enhanced laser desorption/ionization time-of-flight (SELDI-TOF) proteomic data. The proposed two-step Bayes classification framework was equal to and, in some cases, outperformed other classification methods in terms of prediction accuracy, minimum number of classification markers, and computational time. |
format | Online Article Text |
id | pubmed-3784073 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-37840732013-10-08 Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework Assawamakin, Anunchai Prueksaaroon, Supakit Kulawonganunchai, Supasak Shaw, Philip James Varavithya, Vara Ruangrajitpakorn, Taneth Tongsima, Sissades Biomed Res Int Research Article Identification of suitable biomarkers for accurate prediction of phenotypic outcomes is a goal for personalized medicine. However, current machine learning approaches are either too complex or perform poorly. Here, a novel two-step machine-learning framework is presented to address this need. First, a Naïve Bayes estimator is used to rank features from which the top-ranked will most likely contain the most informative features for prediction of the underlying biological classes. The top-ranked features are then used in a Hidden Naïve Bayes classifier to construct a classification prediction model from these filtered attributes. In order to obtain the minimum set of the most informative biomarkers, the bottom-ranked features are successively removed from the Naïve Bayes-filtered feature list one at a time, and the classification accuracy of the Hidden Naïve Bayes classifier is checked for each pruned feature set. The performance of the proposed two-step Bayes classification framework was tested on different types of -omics datasets including gene expression microarray, single nucleotide polymorphism microarray (SNParray), and surface-enhanced laser desorption/ionization time-of-flight (SELDI-TOF) proteomic data. The proposed two-step Bayes classification framework was equal to and, in some cases, outperformed other classification methods in terms of prediction accuracy, minimum number of classification markers, and computational time. Hindawi Publishing Corporation 2013 2013-09-11 /pmc/articles/PMC3784073/ /pubmed/24106694 http://dx.doi.org/10.1155/2013/148014 Text en Copyright © 2013 Anunchai Assawamakin et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Assawamakin, Anunchai Prueksaaroon, Supakit Kulawonganunchai, Supasak Shaw, Philip James Varavithya, Vara Ruangrajitpakorn, Taneth Tongsima, Sissades Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework |
title | Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework |
title_full | Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework |
title_fullStr | Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework |
title_full_unstemmed | Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework |
title_short | Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework |
title_sort | biomarker selection and classification of “-omics” data using a two-step bayes classification framework |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3784073/ https://www.ncbi.nlm.nih.gov/pubmed/24106694 http://dx.doi.org/10.1155/2013/148014 |
work_keys_str_mv | AT assawamakinanunchai biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework AT prueksaaroonsupakit biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework AT kulawonganunchaisupasak biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework AT shawphilipjames biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework AT varavithyavara biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework AT ruangrajitpakorntaneth biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework AT tongsimasissades biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework |