Cargando…
A simpler method of preprocessing MALDI-TOF MS data for differential biomarker analysis: stem cell and melanoma cancer studies
INTRODUCTION: Raw spectral data from matrix-assisted laser desorption/ionisation time-of-flight (MALDI-TOF) with MS profiling techniques usually contains complex information not readily providing biological insight into disease. The association of identified features within raw data to a known pepti...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224566/ https://www.ncbi.nlm.nih.gov/pubmed/21929822 http://dx.doi.org/10.1186/1559-0275-8-14 |
_version_ | 1782217410743369728 |
---|---|
author | Tong, Dong L Boocock, David J Coveney, Clare Saif, Jaimy Gomez, Susana G Querol, Sergio Rees, Robert Ball, Graham R |
author_facet | Tong, Dong L Boocock, David J Coveney, Clare Saif, Jaimy Gomez, Susana G Querol, Sergio Rees, Robert Ball, Graham R |
author_sort | Tong, Dong L |
collection | PubMed |
description | INTRODUCTION: Raw spectral data from matrix-assisted laser desorption/ionisation time-of-flight (MALDI-TOF) with MS profiling techniques usually contains complex information not readily providing biological insight into disease. The association of identified features within raw data to a known peptide is extremely difficult. Data preprocessing to remove uncertainty characteristics in the data is normally required before performing any further analysis. This study proposes an alternative yet simple solution to preprocess raw MALDI-TOF-MS data for identification of candidate marker ions. Two in-house MALDI-TOF-MS data sets from two different sample sources (melanoma serum and cord blood plasma) are used in our study. METHOD: Raw MS spectral profiles were preprocessed using the proposed approach to identify peak regions in the spectra. The preprocessed data was then analysed using bespoke machine learning algorithms for data reduction and ion selection. Using the selected ions, an ANN-based predictive model was constructed to examine the predictive power of these ions for classification. RESULTS: Our model identified 10 candidate marker ions for both data sets. These ion panels achieved over 90% classification accuracy on blind validation data. Receiver operating characteristics analysis was performed and the area under the curve for melanoma and cord blood classifiers was 0.991 and 0.986, respectively. CONCLUSION: The results suggest that our data preprocessing technique removes unwanted characteristics of the raw data, while preserving the predictive components of the data. Ion identification analysis can be carried out using MALDI-TOF-MS data with the proposed data preprocessing technique coupled with bespoke algorithms for data reduction and ion selection. |
format | Online Article Text |
id | pubmed-3224566 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Springer |
record_format | MEDLINE/PubMed |
spelling | pubmed-32245662011-12-22 A simpler method of preprocessing MALDI-TOF MS data for differential biomarker analysis: stem cell and melanoma cancer studies Tong, Dong L Boocock, David J Coveney, Clare Saif, Jaimy Gomez, Susana G Querol, Sergio Rees, Robert Ball, Graham R Clin Proteomics Research INTRODUCTION: Raw spectral data from matrix-assisted laser desorption/ionisation time-of-flight (MALDI-TOF) with MS profiling techniques usually contains complex information not readily providing biological insight into disease. The association of identified features within raw data to a known peptide is extremely difficult. Data preprocessing to remove uncertainty characteristics in the data is normally required before performing any further analysis. This study proposes an alternative yet simple solution to preprocess raw MALDI-TOF-MS data for identification of candidate marker ions. Two in-house MALDI-TOF-MS data sets from two different sample sources (melanoma serum and cord blood plasma) are used in our study. METHOD: Raw MS spectral profiles were preprocessed using the proposed approach to identify peak regions in the spectra. The preprocessed data was then analysed using bespoke machine learning algorithms for data reduction and ion selection. Using the selected ions, an ANN-based predictive model was constructed to examine the predictive power of these ions for classification. RESULTS: Our model identified 10 candidate marker ions for both data sets. These ion panels achieved over 90% classification accuracy on blind validation data. Receiver operating characteristics analysis was performed and the area under the curve for melanoma and cord blood classifiers was 0.991 and 0.986, respectively. CONCLUSION: The results suggest that our data preprocessing technique removes unwanted characteristics of the raw data, while preserving the predictive components of the data. Ion identification analysis can be carried out using MALDI-TOF-MS data with the proposed data preprocessing technique coupled with bespoke algorithms for data reduction and ion selection. Springer 2011-09-19 /pmc/articles/PMC3224566/ /pubmed/21929822 http://dx.doi.org/10.1186/1559-0275-8-14 Text en Copyright ©2011 Tong et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Tong, Dong L Boocock, David J Coveney, Clare Saif, Jaimy Gomez, Susana G Querol, Sergio Rees, Robert Ball, Graham R A simpler method of preprocessing MALDI-TOF MS data for differential biomarker analysis: stem cell and melanoma cancer studies |
title | A simpler method of preprocessing MALDI-TOF MS data for differential biomarker analysis: stem cell and melanoma cancer studies |
title_full | A simpler method of preprocessing MALDI-TOF MS data for differential biomarker analysis: stem cell and melanoma cancer studies |
title_fullStr | A simpler method of preprocessing MALDI-TOF MS data for differential biomarker analysis: stem cell and melanoma cancer studies |
title_full_unstemmed | A simpler method of preprocessing MALDI-TOF MS data for differential biomarker analysis: stem cell and melanoma cancer studies |
title_short | A simpler method of preprocessing MALDI-TOF MS data for differential biomarker analysis: stem cell and melanoma cancer studies |
title_sort | simpler method of preprocessing maldi-tof ms data for differential biomarker analysis: stem cell and melanoma cancer studies |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3224566/ https://www.ncbi.nlm.nih.gov/pubmed/21929822 http://dx.doi.org/10.1186/1559-0275-8-14 |
work_keys_str_mv | AT tongdongl asimplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT boocockdavidj asimplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT coveneyclare asimplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT saifjaimy asimplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT gomezsusanag asimplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT querolsergio asimplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT reesrobert asimplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT ballgrahamr asimplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT tongdongl simplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT boocockdavidj simplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT coveneyclare simplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT saifjaimy simplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT gomezsusanag simplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT querolsergio simplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT reesrobert simplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies AT ballgrahamr simplermethodofpreprocessingmalditofmsdatafordifferentialbiomarkeranalysisstemcellandmelanomacancerstudies |