Cargando…

Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data

BACKGROUND: Recent advances in proteomics technologies such as SELDI-TOF mass spectrometry has shown promise in the detection of early stage cancers. However, dimensionality reduction and classification are considerable challenges in statistical machine learning. We therefore propose a novel approac...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Kai-Lin, Li, Tong-Hua, Xiong, Wen-Wei, Chen, Kai
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2846906/
https://www.ncbi.nlm.nih.gov/pubmed/20187963
http://dx.doi.org/10.1186/1471-2105-11-109
_version_ 1782179516311928832
author Tang, Kai-Lin
Li, Tong-Hua
Xiong, Wen-Wei
Chen, Kai
author_facet Tang, Kai-Lin
Li, Tong-Hua
Xiong, Wen-Wei
Chen, Kai
author_sort Tang, Kai-Lin
collection PubMed
description BACKGROUND: Recent advances in proteomics technologies such as SELDI-TOF mass spectrometry has shown promise in the detection of early stage cancers. However, dimensionality reduction and classification are considerable challenges in statistical machine learning. We therefore propose a novel approach for dimensionality reduction and tested it using published high-resolution SELDI-TOF data for ovarian cancer. RESULTS: We propose a method based on statistical moments to reduce feature dimensions. After refining and t-testing, SELDI-TOF data are divided into several intervals. Four statistical moments (mean, variance, skewness and kurtosis) are calculated for each interval and are used as representative variables. The high dimensionality of the data can thus be rapidly reduced. To improve efficiency and classification performance, the data are further used in kernel PLS models. The method achieved average sensitivity of 0.9950, specificity of 0.9916, accuracy of 0.9935 and a correlation coefficient of 0.9869 for 100 five-fold cross validations. Furthermore, only one control was misclassified in leave-one-out cross validation. CONCLUSION: The proposed method is suitable for analyzing high-throughput proteomics data.
format Text
id pubmed-2846906
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28469062010-03-30 Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data Tang, Kai-Lin Li, Tong-Hua Xiong, Wen-Wei Chen, Kai BMC Bioinformatics Research article BACKGROUND: Recent advances in proteomics technologies such as SELDI-TOF mass spectrometry has shown promise in the detection of early stage cancers. However, dimensionality reduction and classification are considerable challenges in statistical machine learning. We therefore propose a novel approach for dimensionality reduction and tested it using published high-resolution SELDI-TOF data for ovarian cancer. RESULTS: We propose a method based on statistical moments to reduce feature dimensions. After refining and t-testing, SELDI-TOF data are divided into several intervals. Four statistical moments (mean, variance, skewness and kurtosis) are calculated for each interval and are used as representative variables. The high dimensionality of the data can thus be rapidly reduced. To improve efficiency and classification performance, the data are further used in kernel PLS models. The method achieved average sensitivity of 0.9950, specificity of 0.9916, accuracy of 0.9935 and a correlation coefficient of 0.9869 for 100 five-fold cross validations. Furthermore, only one control was misclassified in leave-one-out cross validation. CONCLUSION: The proposed method is suitable for analyzing high-throughput proteomics data. BioMed Central 2010-02-27 /pmc/articles/PMC2846906/ /pubmed/20187963 http://dx.doi.org/10.1186/1471-2105-11-109 Text en Copyright ©2010 Tang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Tang, Kai-Lin
Li, Tong-Hua
Xiong, Wen-Wei
Chen, Kai
Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data
title Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data
title_full Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data
title_fullStr Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data
title_full_unstemmed Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data
title_short Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data
title_sort ovarian cancer classification based on dimensionality reduction for seldi-tof data
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2846906/
https://www.ncbi.nlm.nih.gov/pubmed/20187963
http://dx.doi.org/10.1186/1471-2105-11-109
work_keys_str_mv AT tangkailin ovariancancerclassificationbasedondimensionalityreductionforselditofdata
AT litonghua ovariancancerclassificationbasedondimensionalityreductionforselditofdata
AT xiongwenwei ovariancancerclassificationbasedondimensionalityreductionforselditofdata
AT chenkai ovariancancerclassificationbasedondimensionalityreductionforselditofdata