Cargando…

Identification of biomarkers from mass spectrometry data using a "common" peak approach

BACKGROUND: Proteomic data obtained from mass spectrometry have attracted great interest for the detection of early-stage cancer. However, as mass spectrometry data are high-dimensional, identification of biomarkers is a key problem. RESULTS: This paper proposes the use of "common" peaks i...

Descripción completa

Detalles Bibliográficos
Autores principales: Fushiki, Tadayoshi, Fujisawa, Hironori, Eguchi, Shinto
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1550264/
https://www.ncbi.nlm.nih.gov/pubmed/16869977
http://dx.doi.org/10.1186/1471-2105-7-358
_version_ 1782129215558123520
author Fushiki, Tadayoshi
Fujisawa, Hironori
Eguchi, Shinto
author_facet Fushiki, Tadayoshi
Fujisawa, Hironori
Eguchi, Shinto
author_sort Fushiki, Tadayoshi
collection PubMed
description BACKGROUND: Proteomic data obtained from mass spectrometry have attracted great interest for the detection of early-stage cancer. However, as mass spectrometry data are high-dimensional, identification of biomarkers is a key problem. RESULTS: This paper proposes the use of "common" peaks in data as biomarkers. Analysis is conducted as follows: data preprocessing, identification of biomarkers, and application of AdaBoost to construct a classification function. Informative "common" peaks are selected by AdaBoost. AsymBoost is also examined to balance false negatives and false positives. The effectiveness of the approach is demonstrated using an ovarian cancer dataset. CONCLUSION: Continuous covariates and discrete covariates can be used in the present approach. The difference between the result for the continuous covariates and that for the discrete covariates was investigated in detail. In the example considered here, both covariates provide a good prediction, but it seems that they provide different kinds of information. We can obtain more information on the structure of the data by integrating both results.
format Text
id pubmed-1550264
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15502642006-08-19 Identification of biomarkers from mass spectrometry data using a "common" peak approach Fushiki, Tadayoshi Fujisawa, Hironori Eguchi, Shinto BMC Bioinformatics Methodology Article BACKGROUND: Proteomic data obtained from mass spectrometry have attracted great interest for the detection of early-stage cancer. However, as mass spectrometry data are high-dimensional, identification of biomarkers is a key problem. RESULTS: This paper proposes the use of "common" peaks in data as biomarkers. Analysis is conducted as follows: data preprocessing, identification of biomarkers, and application of AdaBoost to construct a classification function. Informative "common" peaks are selected by AdaBoost. AsymBoost is also examined to balance false negatives and false positives. The effectiveness of the approach is demonstrated using an ovarian cancer dataset. CONCLUSION: Continuous covariates and discrete covariates can be used in the present approach. The difference between the result for the continuous covariates and that for the discrete covariates was investigated in detail. In the example considered here, both covariates provide a good prediction, but it seems that they provide different kinds of information. We can obtain more information on the structure of the data by integrating both results. BioMed Central 2006-07-26 /pmc/articles/PMC1550264/ /pubmed/16869977 http://dx.doi.org/10.1186/1471-2105-7-358 Text en Copyright © 2006 Fushiki et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Fushiki, Tadayoshi
Fujisawa, Hironori
Eguchi, Shinto
Identification of biomarkers from mass spectrometry data using a "common" peak approach
title Identification of biomarkers from mass spectrometry data using a "common" peak approach
title_full Identification of biomarkers from mass spectrometry data using a "common" peak approach
title_fullStr Identification of biomarkers from mass spectrometry data using a "common" peak approach
title_full_unstemmed Identification of biomarkers from mass spectrometry data using a "common" peak approach
title_short Identification of biomarkers from mass spectrometry data using a "common" peak approach
title_sort identification of biomarkers from mass spectrometry data using a "common" peak approach
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1550264/
https://www.ncbi.nlm.nih.gov/pubmed/16869977
http://dx.doi.org/10.1186/1471-2105-7-358
work_keys_str_mv AT fushikitadayoshi identificationofbiomarkersfrommassspectrometrydatausingacommonpeakapproach
AT fujisawahironori identificationofbiomarkersfrommassspectrometrydatausingacommonpeakapproach
AT eguchishinto identificationofbiomarkersfrommassspectrometrydatausingacommonpeakapproach