Cargando…

A new peak detection algorithm for MALDI mass spectrometry data based on a modified Asymmetric Pseudo-Voigt model

BACKGROUND: Mass Spectrometry (MS) is a ubiquitous analytical tool in biological research and is used to measure the mass-to-charge ratio of bio-molecules. Peak detection is the essential first step in MS data analysis. Precise estimation of peak parameters such as peak summit location and peak area...

Descripción completa

Detalles Bibliográficos
Autores principales: Wijetunge, Chalini D, Saeed, Isaam, Boughton, Berin A, Roessner, Ute, Halgamuge, Saman K
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4682410/
https://www.ncbi.nlm.nih.gov/pubmed/26680279
http://dx.doi.org/10.1186/1471-2164-16-S12-S12
_version_ 1782405885121789952
author Wijetunge, Chalini D
Saeed, Isaam
Boughton, Berin A
Roessner, Ute
Halgamuge, Saman K
author_facet Wijetunge, Chalini D
Saeed, Isaam
Boughton, Berin A
Roessner, Ute
Halgamuge, Saman K
author_sort Wijetunge, Chalini D
collection PubMed
description BACKGROUND: Mass Spectrometry (MS) is a ubiquitous analytical tool in biological research and is used to measure the mass-to-charge ratio of bio-molecules. Peak detection is the essential first step in MS data analysis. Precise estimation of peak parameters such as peak summit location and peak area are critical to identify underlying bio-molecules and to estimate their abundances accurately. We propose a new method to detect and quantify peaks in mass spectra. It uses dual-tree complex wavelet transformation along with Stein's unbiased risk estimator for spectra smoothing. Then, a new method, based on the modified Asymmetric Pseudo-Voigt (mAPV) model and hierarchical particle swarm optimization, is used for peak parameter estimation. RESULTS: Using simulated data, we demonstrated the benefit of using the mAPV model over Gaussian, Lorentz and Bi-Gaussian functions for MS peak modelling. The proposed mAPV model achieved the best fitting accuracy for asymmetric peaks, with lower percentage errors in peak summit location estimation, which were 0.17% to 4.46% less than that of the other models. It also outperformed the other models in peak area estimation, delivering lower percentage errors, which were about 0.7% less than its closest competitor - the Bi-Gaussian model. In addition, using data generated from a MALDI-TOF computer model, we showed that the proposed overall algorithm outperformed the existing methods mainly in terms of sensitivity. It achieved a sensitivity of 85%, compared to 77% and 71% of the two benchmark algorithms, continuous wavelet transformation based method and Cromwell respectively. CONCLUSIONS: The proposed algorithm is particularly useful for peak detection and parameter estimation in MS data with overlapping peak distributions and asymmetric peaks. The algorithm is implemented using MATLAB and the source code is freely available at http://mapv.sourceforge.net.
format Online
Article
Text
id pubmed-4682410
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46824102015-12-21 A new peak detection algorithm for MALDI mass spectrometry data based on a modified Asymmetric Pseudo-Voigt model Wijetunge, Chalini D Saeed, Isaam Boughton, Berin A Roessner, Ute Halgamuge, Saman K BMC Genomics Research BACKGROUND: Mass Spectrometry (MS) is a ubiquitous analytical tool in biological research and is used to measure the mass-to-charge ratio of bio-molecules. Peak detection is the essential first step in MS data analysis. Precise estimation of peak parameters such as peak summit location and peak area are critical to identify underlying bio-molecules and to estimate their abundances accurately. We propose a new method to detect and quantify peaks in mass spectra. It uses dual-tree complex wavelet transformation along with Stein's unbiased risk estimator for spectra smoothing. Then, a new method, based on the modified Asymmetric Pseudo-Voigt (mAPV) model and hierarchical particle swarm optimization, is used for peak parameter estimation. RESULTS: Using simulated data, we demonstrated the benefit of using the mAPV model over Gaussian, Lorentz and Bi-Gaussian functions for MS peak modelling. The proposed mAPV model achieved the best fitting accuracy for asymmetric peaks, with lower percentage errors in peak summit location estimation, which were 0.17% to 4.46% less than that of the other models. It also outperformed the other models in peak area estimation, delivering lower percentage errors, which were about 0.7% less than its closest competitor - the Bi-Gaussian model. In addition, using data generated from a MALDI-TOF computer model, we showed that the proposed overall algorithm outperformed the existing methods mainly in terms of sensitivity. It achieved a sensitivity of 85%, compared to 77% and 71% of the two benchmark algorithms, continuous wavelet transformation based method and Cromwell respectively. CONCLUSIONS: The proposed algorithm is particularly useful for peak detection and parameter estimation in MS data with overlapping peak distributions and asymmetric peaks. The algorithm is implemented using MATLAB and the source code is freely available at http://mapv.sourceforge.net. BioMed Central 2015-12-09 /pmc/articles/PMC4682410/ /pubmed/26680279 http://dx.doi.org/10.1186/1471-2164-16-S12-S12 Text en Copyright © 2015 Wijetunge et al. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Wijetunge, Chalini D
Saeed, Isaam
Boughton, Berin A
Roessner, Ute
Halgamuge, Saman K
A new peak detection algorithm for MALDI mass spectrometry data based on a modified Asymmetric Pseudo-Voigt model
title A new peak detection algorithm for MALDI mass spectrometry data based on a modified Asymmetric Pseudo-Voigt model
title_full A new peak detection algorithm for MALDI mass spectrometry data based on a modified Asymmetric Pseudo-Voigt model
title_fullStr A new peak detection algorithm for MALDI mass spectrometry data based on a modified Asymmetric Pseudo-Voigt model
title_full_unstemmed A new peak detection algorithm for MALDI mass spectrometry data based on a modified Asymmetric Pseudo-Voigt model
title_short A new peak detection algorithm for MALDI mass spectrometry data based on a modified Asymmetric Pseudo-Voigt model
title_sort new peak detection algorithm for maldi mass spectrometry data based on a modified asymmetric pseudo-voigt model
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4682410/
https://www.ncbi.nlm.nih.gov/pubmed/26680279
http://dx.doi.org/10.1186/1471-2164-16-S12-S12
work_keys_str_mv AT wijetungechalinid anewpeakdetectionalgorithmformaldimassspectrometrydatabasedonamodifiedasymmetricpseudovoigtmodel
AT saeedisaam anewpeakdetectionalgorithmformaldimassspectrometrydatabasedonamodifiedasymmetricpseudovoigtmodel
AT boughtonberina anewpeakdetectionalgorithmformaldimassspectrometrydatabasedonamodifiedasymmetricpseudovoigtmodel
AT roessnerute anewpeakdetectionalgorithmformaldimassspectrometrydatabasedonamodifiedasymmetricpseudovoigtmodel
AT halgamugesamank anewpeakdetectionalgorithmformaldimassspectrometrydatabasedonamodifiedasymmetricpseudovoigtmodel
AT wijetungechalinid newpeakdetectionalgorithmformaldimassspectrometrydatabasedonamodifiedasymmetricpseudovoigtmodel
AT saeedisaam newpeakdetectionalgorithmformaldimassspectrometrydatabasedonamodifiedasymmetricpseudovoigtmodel
AT boughtonberina newpeakdetectionalgorithmformaldimassspectrometrydatabasedonamodifiedasymmetricpseudovoigtmodel
AT roessnerute newpeakdetectionalgorithmformaldimassspectrometrydatabasedonamodifiedasymmetricpseudovoigtmodel
AT halgamugesamank newpeakdetectionalgorithmformaldimassspectrometrydatabasedonamodifiedasymmetricpseudovoigtmodel