Cargando…

Network-based support vector machine for classification of microarray samples

BACKGROUND: The importance of network-based approach to identifying biological markers for diagnostic classification and prognostic assessment in the context of microarray data has been increasingly recognized. To our knowledge, there have been few, if any, statistical tools that explicitly incorpor...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Yanni, Shen, Xiaotong, Pan, Wei
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648796/
https://www.ncbi.nlm.nih.gov/pubmed/19208121
http://dx.doi.org/10.1186/1471-2105-10-S1-S21
_version_ 1782164990362386432
author Zhu, Yanni
Shen, Xiaotong
Pan, Wei
author_facet Zhu, Yanni
Shen, Xiaotong
Pan, Wei
author_sort Zhu, Yanni
collection PubMed
description BACKGROUND: The importance of network-based approach to identifying biological markers for diagnostic classification and prognostic assessment in the context of microarray data has been increasingly recognized. To our knowledge, there have been few, if any, statistical tools that explicitly incorporate the prior information of gene networks into classifier building. The main idea of this paper is to take full advantage of the biological observation that neighboring genes in a network tend to function together in biological processes and to embed this information into a formal statistical framework. RESULTS: We propose a network-based support vector machine for binary classification problems by constructing a penalty term from the F(∞)-norm being applied to pairwise gene neighbors with the hope to improve predictive performance and gene selection. Simulation studies in both low- and high-dimensional data settings as well as two real microarray applications indicate that the proposed method is able to identify more clinically relevant genes while maintaining a sparse model with either similar or higher prediction accuracy compared with the standard and the L(1 )penalized support vector machines. CONCLUSION: The proposed network-based support vector machine has the potential to be a practically useful classification tool for microarrays and other high-dimensional data.
format Text
id pubmed-2648796
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26487962009-03-03 Network-based support vector machine for classification of microarray samples Zhu, Yanni Shen, Xiaotong Pan, Wei BMC Bioinformatics Research BACKGROUND: The importance of network-based approach to identifying biological markers for diagnostic classification and prognostic assessment in the context of microarray data has been increasingly recognized. To our knowledge, there have been few, if any, statistical tools that explicitly incorporate the prior information of gene networks into classifier building. The main idea of this paper is to take full advantage of the biological observation that neighboring genes in a network tend to function together in biological processes and to embed this information into a formal statistical framework. RESULTS: We propose a network-based support vector machine for binary classification problems by constructing a penalty term from the F(∞)-norm being applied to pairwise gene neighbors with the hope to improve predictive performance and gene selection. Simulation studies in both low- and high-dimensional data settings as well as two real microarray applications indicate that the proposed method is able to identify more clinically relevant genes while maintaining a sparse model with either similar or higher prediction accuracy compared with the standard and the L(1 )penalized support vector machines. CONCLUSION: The proposed network-based support vector machine has the potential to be a practically useful classification tool for microarrays and other high-dimensional data. BioMed Central 2009-01-30 /pmc/articles/PMC2648796/ /pubmed/19208121 http://dx.doi.org/10.1186/1471-2105-10-S1-S21 Text en Copyright © 2009 Zhu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Zhu, Yanni
Shen, Xiaotong
Pan, Wei
Network-based support vector machine for classification of microarray samples
title Network-based support vector machine for classification of microarray samples
title_full Network-based support vector machine for classification of microarray samples
title_fullStr Network-based support vector machine for classification of microarray samples
title_full_unstemmed Network-based support vector machine for classification of microarray samples
title_short Network-based support vector machine for classification of microarray samples
title_sort network-based support vector machine for classification of microarray samples
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2648796/
https://www.ncbi.nlm.nih.gov/pubmed/19208121
http://dx.doi.org/10.1186/1471-2105-10-S1-S21
work_keys_str_mv AT zhuyanni networkbasedsupportvectormachineforclassificationofmicroarraysamples
AT shenxiaotong networkbasedsupportvectormachineforclassificationofmicroarraysamples
AT panwei networkbasedsupportvectormachineforclassificationofmicroarraysamples