Cargando…

Signal Partitioning Algorithm for Highly Efficient Gaussian Mixture Modeling in Mass Spectrometry

Mixture - modeling of mass spectra is an approach with many potential applications including peak detection and quantification, smoothing, de-noising, feature extraction and spectral signal compression. However, existing algorithms do not allow for automated analyses of whole spectra. Therefore, des...

Descripción completa

Detalles Bibliográficos
Autores principales: Polanski, Andrzej, Marczyk, Michal, Pietrowska, Monika, Widlak, Piotr, Polanska, Joanna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4521892/
https://www.ncbi.nlm.nih.gov/pubmed/26230717
http://dx.doi.org/10.1371/journal.pone.0134256
_version_ 1782383879282229248
author Polanski, Andrzej
Marczyk, Michal
Pietrowska, Monika
Widlak, Piotr
Polanska, Joanna
author_facet Polanski, Andrzej
Marczyk, Michal
Pietrowska, Monika
Widlak, Piotr
Polanska, Joanna
author_sort Polanski, Andrzej
collection PubMed
description Mixture - modeling of mass spectra is an approach with many potential applications including peak detection and quantification, smoothing, de-noising, feature extraction and spectral signal compression. However, existing algorithms do not allow for automated analyses of whole spectra. Therefore, despite highlighting potential advantages of mixture modeling of mass spectra of peptide/protein mixtures and some preliminary results presented in several papers, the mixture modeling approach was so far not developed to the stage enabling systematic comparisons with existing software packages for proteomic mass spectra analyses. In this paper we present an efficient algorithm for Gaussian mixture modeling of proteomic mass spectra of different types (e.g., MALDI-ToF profiling, MALDI-IMS). The main idea is automated partitioning of protein mass spectral signal into fragments. The obtained fragments are separately decomposed into Gaussian mixture models. The parameters of the mixture models of fragments are then aggregated to form the mixture model of the whole spectrum. We compare the elaborated algorithm to existing algorithms for peak detection and we demonstrate improvements of peak detection efficiency obtained by using Gaussian mixture modeling. We also show applications of the elaborated algorithm to real proteomic datasets of low and high resolution.
format Online
Article
Text
id pubmed-4521892
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-45218922015-08-06 Signal Partitioning Algorithm for Highly Efficient Gaussian Mixture Modeling in Mass Spectrometry Polanski, Andrzej Marczyk, Michal Pietrowska, Monika Widlak, Piotr Polanska, Joanna PLoS One Research Article Mixture - modeling of mass spectra is an approach with many potential applications including peak detection and quantification, smoothing, de-noising, feature extraction and spectral signal compression. However, existing algorithms do not allow for automated analyses of whole spectra. Therefore, despite highlighting potential advantages of mixture modeling of mass spectra of peptide/protein mixtures and some preliminary results presented in several papers, the mixture modeling approach was so far not developed to the stage enabling systematic comparisons with existing software packages for proteomic mass spectra analyses. In this paper we present an efficient algorithm for Gaussian mixture modeling of proteomic mass spectra of different types (e.g., MALDI-ToF profiling, MALDI-IMS). The main idea is automated partitioning of protein mass spectral signal into fragments. The obtained fragments are separately decomposed into Gaussian mixture models. The parameters of the mixture models of fragments are then aggregated to form the mixture model of the whole spectrum. We compare the elaborated algorithm to existing algorithms for peak detection and we demonstrate improvements of peak detection efficiency obtained by using Gaussian mixture modeling. We also show applications of the elaborated algorithm to real proteomic datasets of low and high resolution. Public Library of Science 2015-07-31 /pmc/articles/PMC4521892/ /pubmed/26230717 http://dx.doi.org/10.1371/journal.pone.0134256 Text en © 2015 Polanski et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Polanski, Andrzej
Marczyk, Michal
Pietrowska, Monika
Widlak, Piotr
Polanska, Joanna
Signal Partitioning Algorithm for Highly Efficient Gaussian Mixture Modeling in Mass Spectrometry
title Signal Partitioning Algorithm for Highly Efficient Gaussian Mixture Modeling in Mass Spectrometry
title_full Signal Partitioning Algorithm for Highly Efficient Gaussian Mixture Modeling in Mass Spectrometry
title_fullStr Signal Partitioning Algorithm for Highly Efficient Gaussian Mixture Modeling in Mass Spectrometry
title_full_unstemmed Signal Partitioning Algorithm for Highly Efficient Gaussian Mixture Modeling in Mass Spectrometry
title_short Signal Partitioning Algorithm for Highly Efficient Gaussian Mixture Modeling in Mass Spectrometry
title_sort signal partitioning algorithm for highly efficient gaussian mixture modeling in mass spectrometry
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4521892/
https://www.ncbi.nlm.nih.gov/pubmed/26230717
http://dx.doi.org/10.1371/journal.pone.0134256
work_keys_str_mv AT polanskiandrzej signalpartitioningalgorithmforhighlyefficientgaussianmixturemodelinginmassspectrometry
AT marczykmichal signalpartitioningalgorithmforhighlyefficientgaussianmixturemodelinginmassspectrometry
AT pietrowskamonika signalpartitioningalgorithmforhighlyefficientgaussianmixturemodelinginmassspectrometry
AT widlakpiotr signalpartitioningalgorithmforhighlyefficientgaussianmixturemodelinginmassspectrometry
AT polanskajoanna signalpartitioningalgorithmforhighlyefficientgaussianmixturemodelinginmassspectrometry