Cargando…
Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation
Motivation: Automatic classification of high-resolution mass spectrometry proteomic data has increasing potential in the early diagnosis of cancer. We propose a new procedure of biomarker discovery in serum protein profiles based on: (i) discrete wavelet transformation of the spectra; (ii) selection...
Autores principales: | , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2647828/ https://www.ncbi.nlm.nih.gov/pubmed/19244390 http://dx.doi.org/10.1093/bioinformatics/btn662 |
_version_ | 1782164940086312960 |
---|---|
author | Alexandrov, Theodore Decker, Jens Mertens, Bart Deelder, Andre M. Tollenaar, Rob A. E. M. Maass, Peter Thiele, Herbert |
author_facet | Alexandrov, Theodore Decker, Jens Mertens, Bart Deelder, Andre M. Tollenaar, Rob A. E. M. Maass, Peter Thiele, Herbert |
author_sort | Alexandrov, Theodore |
collection | PubMed |
description | Motivation: Automatic classification of high-resolution mass spectrometry proteomic data has increasing potential in the early diagnosis of cancer. We propose a new procedure of biomarker discovery in serum protein profiles based on: (i) discrete wavelet transformation of the spectra; (ii) selection of discriminative wavelet coefficients by a statistical test and (iii) building and evaluating a support vector machine classifier by double cross-validation with attention to the generalizability of the results. In addition to the evaluation results (total recognition rate, sensitivity and specificity), the procedure provides the biomarker patterns, i.e. the parts of spectra which discriminate cancer and control individuals. The evaluation was performed on matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) serum protein profiles of 66 colorectal cancer patients and 50 controls. Results: Our procedure provided a high recognition rate (97.3%), sensitivity (98.4%) and specificity (95.8%). The extracted biomarker patterns mostly represent the peaks expressing mean differences between the cancer and control spectra. However, we showed that the discriminative power of a peak is not simply expressed by its mean height and cannot be derived by comparison of the mean spectra. The obtained classifiers have high generalization power as measured by the number of support vectors. This prevents overfitting and contributes to the reproducibility of the results, which is required to find biomarkers differentiating cancer patients from healthy individuals. Availability: The data and scripts used in this study are available at http://www.math.uni-bremen.de/~theodore/MALDIDWT. Contact: theodore@math.uni-bremen.de Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Text |
id | pubmed-2647828 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-26478282009-04-02 Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation Alexandrov, Theodore Decker, Jens Mertens, Bart Deelder, Andre M. Tollenaar, Rob A. E. M. Maass, Peter Thiele, Herbert Bioinformatics Original Papers Motivation: Automatic classification of high-resolution mass spectrometry proteomic data has increasing potential in the early diagnosis of cancer. We propose a new procedure of biomarker discovery in serum protein profiles based on: (i) discrete wavelet transformation of the spectra; (ii) selection of discriminative wavelet coefficients by a statistical test and (iii) building and evaluating a support vector machine classifier by double cross-validation with attention to the generalizability of the results. In addition to the evaluation results (total recognition rate, sensitivity and specificity), the procedure provides the biomarker patterns, i.e. the parts of spectra which discriminate cancer and control individuals. The evaluation was performed on matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) serum protein profiles of 66 colorectal cancer patients and 50 controls. Results: Our procedure provided a high recognition rate (97.3%), sensitivity (98.4%) and specificity (95.8%). The extracted biomarker patterns mostly represent the peaks expressing mean differences between the cancer and control spectra. However, we showed that the discriminative power of a peak is not simply expressed by its mean height and cannot be derived by comparison of the mean spectra. The obtained classifiers have high generalization power as measured by the number of support vectors. This prevents overfitting and contributes to the reproducibility of the results, which is required to find biomarkers differentiating cancer patients from healthy individuals. Availability: The data and scripts used in this study are available at http://www.math.uni-bremen.de/~theodore/MALDIDWT. Contact: theodore@math.uni-bremen.de Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2009-03-01 2009-01-06 /pmc/articles/PMC2647828/ /pubmed/19244390 http://dx.doi.org/10.1093/bioinformatics/btn662 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Alexandrov, Theodore Decker, Jens Mertens, Bart Deelder, Andre M. Tollenaar, Rob A. E. M. Maass, Peter Thiele, Herbert Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation |
title | Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation |
title_full | Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation |
title_fullStr | Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation |
title_full_unstemmed | Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation |
title_short | Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation |
title_sort | biomarker discovery in maldi-tof serum protein profiles using discrete wavelet transformation |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2647828/ https://www.ncbi.nlm.nih.gov/pubmed/19244390 http://dx.doi.org/10.1093/bioinformatics/btn662 |
work_keys_str_mv | AT alexandrovtheodore biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation AT deckerjens biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation AT mertensbart biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation AT deelderandrem biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation AT tollenaarrobaem biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation AT maasspeter biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation AT thieleherbert biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation |