Cargando…

Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework

Identification of suitable biomarkers for accurate prediction of phenotypic outcomes is a goal for personalized medicine. However, current machine learning approaches are either too complex or perform poorly. Here, a novel two-step machine-learning framework is presented to address this need. First,...

Descripción completa

Detalles Bibliográficos
Autores principales: Assawamakin, Anunchai, Prueksaaroon, Supakit, Kulawonganunchai, Supasak, Shaw, Philip James, Varavithya, Vara, Ruangrajitpakorn, Taneth, Tongsima, Sissades
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3784073/
https://www.ncbi.nlm.nih.gov/pubmed/24106694
http://dx.doi.org/10.1155/2013/148014
_version_ 1782285762283175936
author Assawamakin, Anunchai
Prueksaaroon, Supakit
Kulawonganunchai, Supasak
Shaw, Philip James
Varavithya, Vara
Ruangrajitpakorn, Taneth
Tongsima, Sissades
author_facet Assawamakin, Anunchai
Prueksaaroon, Supakit
Kulawonganunchai, Supasak
Shaw, Philip James
Varavithya, Vara
Ruangrajitpakorn, Taneth
Tongsima, Sissades
author_sort Assawamakin, Anunchai
collection PubMed
description Identification of suitable biomarkers for accurate prediction of phenotypic outcomes is a goal for personalized medicine. However, current machine learning approaches are either too complex or perform poorly. Here, a novel two-step machine-learning framework is presented to address this need. First, a Naïve Bayes estimator is used to rank features from which the top-ranked will most likely contain the most informative features for prediction of the underlying biological classes. The top-ranked features are then used in a Hidden Naïve Bayes classifier to construct a classification prediction model from these filtered attributes. In order to obtain the minimum set of the most informative biomarkers, the bottom-ranked features are successively removed from the Naïve Bayes-filtered feature list one at a time, and the classification accuracy of the Hidden Naïve Bayes classifier is checked for each pruned feature set. The performance of the proposed two-step Bayes classification framework was tested on different types of -omics datasets including gene expression microarray, single nucleotide polymorphism microarray (SNParray), and surface-enhanced laser desorption/ionization time-of-flight (SELDI-TOF) proteomic data. The proposed two-step Bayes classification framework was equal to and, in some cases, outperformed other classification methods in terms of prediction accuracy, minimum number of classification markers, and computational time.
format Online
Article
Text
id pubmed-3784073
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-37840732013-10-08 Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework Assawamakin, Anunchai Prueksaaroon, Supakit Kulawonganunchai, Supasak Shaw, Philip James Varavithya, Vara Ruangrajitpakorn, Taneth Tongsima, Sissades Biomed Res Int Research Article Identification of suitable biomarkers for accurate prediction of phenotypic outcomes is a goal for personalized medicine. However, current machine learning approaches are either too complex or perform poorly. Here, a novel two-step machine-learning framework is presented to address this need. First, a Naïve Bayes estimator is used to rank features from which the top-ranked will most likely contain the most informative features for prediction of the underlying biological classes. The top-ranked features are then used in a Hidden Naïve Bayes classifier to construct a classification prediction model from these filtered attributes. In order to obtain the minimum set of the most informative biomarkers, the bottom-ranked features are successively removed from the Naïve Bayes-filtered feature list one at a time, and the classification accuracy of the Hidden Naïve Bayes classifier is checked for each pruned feature set. The performance of the proposed two-step Bayes classification framework was tested on different types of -omics datasets including gene expression microarray, single nucleotide polymorphism microarray (SNParray), and surface-enhanced laser desorption/ionization time-of-flight (SELDI-TOF) proteomic data. The proposed two-step Bayes classification framework was equal to and, in some cases, outperformed other classification methods in terms of prediction accuracy, minimum number of classification markers, and computational time. Hindawi Publishing Corporation 2013 2013-09-11 /pmc/articles/PMC3784073/ /pubmed/24106694 http://dx.doi.org/10.1155/2013/148014 Text en Copyright © 2013 Anunchai Assawamakin et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Assawamakin, Anunchai
Prueksaaroon, Supakit
Kulawonganunchai, Supasak
Shaw, Philip James
Varavithya, Vara
Ruangrajitpakorn, Taneth
Tongsima, Sissades
Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework
title Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework
title_full Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework
title_fullStr Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework
title_full_unstemmed Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework
title_short Biomarker Selection and Classification of “-Omics” Data Using a Two-Step Bayes Classification Framework
title_sort biomarker selection and classification of “-omics” data using a two-step bayes classification framework
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3784073/
https://www.ncbi.nlm.nih.gov/pubmed/24106694
http://dx.doi.org/10.1155/2013/148014
work_keys_str_mv AT assawamakinanunchai biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework
AT prueksaaroonsupakit biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework
AT kulawonganunchaisupasak biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework
AT shawphilipjames biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework
AT varavithyavara biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework
AT ruangrajitpakorntaneth biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework
AT tongsimasissades biomarkerselectionandclassificationofomicsdatausingatwostepbayesclassificationframework