Cargando…

Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation

Motivation: Automatic classification of high-resolution mass spectrometry proteomic data has increasing potential in the early diagnosis of cancer. We propose a new procedure of biomarker discovery in serum protein profiles based on: (i) discrete wavelet transformation of the spectra; (ii) selection...

Descripción completa

Detalles Bibliográficos
Autores principales: Alexandrov, Theodore, Decker, Jens, Mertens, Bart, Deelder, Andre M., Tollenaar, Rob A. E. M., Maass, Peter, Thiele, Herbert
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2647828/
https://www.ncbi.nlm.nih.gov/pubmed/19244390
http://dx.doi.org/10.1093/bioinformatics/btn662
_version_ 1782164940086312960
author Alexandrov, Theodore
Decker, Jens
Mertens, Bart
Deelder, Andre M.
Tollenaar, Rob A. E. M.
Maass, Peter
Thiele, Herbert
author_facet Alexandrov, Theodore
Decker, Jens
Mertens, Bart
Deelder, Andre M.
Tollenaar, Rob A. E. M.
Maass, Peter
Thiele, Herbert
author_sort Alexandrov, Theodore
collection PubMed
description Motivation: Automatic classification of high-resolution mass spectrometry proteomic data has increasing potential in the early diagnosis of cancer. We propose a new procedure of biomarker discovery in serum protein profiles based on: (i) discrete wavelet transformation of the spectra; (ii) selection of discriminative wavelet coefficients by a statistical test and (iii) building and evaluating a support vector machine classifier by double cross-validation with attention to the generalizability of the results. In addition to the evaluation results (total recognition rate, sensitivity and specificity), the procedure provides the biomarker patterns, i.e. the parts of spectra which discriminate cancer and control individuals. The evaluation was performed on matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) serum protein profiles of 66 colorectal cancer patients and 50 controls. Results: Our procedure provided a high recognition rate (97.3%), sensitivity (98.4%) and specificity (95.8%). The extracted biomarker patterns mostly represent the peaks expressing mean differences between the cancer and control spectra. However, we showed that the discriminative power of a peak is not simply expressed by its mean height and cannot be derived by comparison of the mean spectra. The obtained classifiers have high generalization power as measured by the number of support vectors. This prevents overfitting and contributes to the reproducibility of the results, which is required to find biomarkers differentiating cancer patients from healthy individuals. Availability: The data and scripts used in this study are available at http://www.math.uni-bremen.de/~theodore/MALDIDWT. Contact: theodore@math.uni-bremen.de Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-2647828
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-26478282009-04-02 Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation Alexandrov, Theodore Decker, Jens Mertens, Bart Deelder, Andre M. Tollenaar, Rob A. E. M. Maass, Peter Thiele, Herbert Bioinformatics Original Papers Motivation: Automatic classification of high-resolution mass spectrometry proteomic data has increasing potential in the early diagnosis of cancer. We propose a new procedure of biomarker discovery in serum protein profiles based on: (i) discrete wavelet transformation of the spectra; (ii) selection of discriminative wavelet coefficients by a statistical test and (iii) building and evaluating a support vector machine classifier by double cross-validation with attention to the generalizability of the results. In addition to the evaluation results (total recognition rate, sensitivity and specificity), the procedure provides the biomarker patterns, i.e. the parts of spectra which discriminate cancer and control individuals. The evaluation was performed on matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) serum protein profiles of 66 colorectal cancer patients and 50 controls. Results: Our procedure provided a high recognition rate (97.3%), sensitivity (98.4%) and specificity (95.8%). The extracted biomarker patterns mostly represent the peaks expressing mean differences between the cancer and control spectra. However, we showed that the discriminative power of a peak is not simply expressed by its mean height and cannot be derived by comparison of the mean spectra. The obtained classifiers have high generalization power as measured by the number of support vectors. This prevents overfitting and contributes to the reproducibility of the results, which is required to find biomarkers differentiating cancer patients from healthy individuals. Availability: The data and scripts used in this study are available at http://www.math.uni-bremen.de/~theodore/MALDIDWT. Contact: theodore@math.uni-bremen.de Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2009-03-01 2009-01-06 /pmc/articles/PMC2647828/ /pubmed/19244390 http://dx.doi.org/10.1093/bioinformatics/btn662 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Alexandrov, Theodore
Decker, Jens
Mertens, Bart
Deelder, Andre M.
Tollenaar, Rob A. E. M.
Maass, Peter
Thiele, Herbert
Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation
title Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation
title_full Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation
title_fullStr Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation
title_full_unstemmed Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation
title_short Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation
title_sort biomarker discovery in maldi-tof serum protein profiles using discrete wavelet transformation
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2647828/
https://www.ncbi.nlm.nih.gov/pubmed/19244390
http://dx.doi.org/10.1093/bioinformatics/btn662
work_keys_str_mv AT alexandrovtheodore biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation
AT deckerjens biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation
AT mertensbart biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation
AT deelderandrem biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation
AT tollenaarrobaem biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation
AT maasspeter biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation
AT thieleherbert biomarkerdiscoveryinmalditofserumproteinprofilesusingdiscretewavelettransformation