Cargando…

ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity

BACKGROUND: The analysis of mass spectra suggests that the existence of derivative peaks is strongly dependent on the intensity of the primary peaks. Peak selection from tandem mass spectrum is used to filter out noise and contaminant peaks. It is widely accepted that a valid primary peak tends to h...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Shenghui, Wang, Yaojun, Bu, Dongbo, Zhang, Hong, Sun, Shiwei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3179969/
https://www.ncbi.nlm.nih.gov/pubmed/21849060
http://dx.doi.org/10.1186/1471-2105-12-346
_version_ 1782212578710126592
author Zhang, Shenghui
Wang, Yaojun
Bu, Dongbo
Zhang, Hong
Sun, Shiwei
author_facet Zhang, Shenghui
Wang, Yaojun
Bu, Dongbo
Zhang, Hong
Sun, Shiwei
author_sort Zhang, Shenghui
collection PubMed
description BACKGROUND: The analysis of mass spectra suggests that the existence of derivative peaks is strongly dependent on the intensity of the primary peaks. Peak selection from tandem mass spectrum is used to filter out noise and contaminant peaks. It is widely accepted that a valid primary peak tends to have high intensity and is accompanied by derivative peaks, including isotopic peaks, neutral loss peaks, and complementary peaks. Existing models for peak selection ignore the dependence between the existence of the derivative peaks and the intensity of the primary peaks. Simple models for peak selection assume that these two attributes are independent; however, this assumption is contrary to real data and prone to error. RESULTS: In this paper, we present a statistical model to quantitatively measure the dependence of the derivative peak's existence on the primary peak's intensity. Here, we propose a statistical model, named ProbPS, to capture the dependence in a quantitative manner and describe a statistical model for peak selection. Our results show that the quantitative understanding can successfully guide the peak selection process. By comparing ProbPS with AuDeNS we demonstrate the advantages of our method in both filtering out noise peaks and in improving de novo identification. In addition, we present a tag identification approach based on our peak selection method. Our results, using a test data set, suggest that our tag identification method (876 correct tags in 1000 spectra) outperforms PepNovoTag (790 correct tags in 1000 spectra). CONCLUSIONS: We have shown that ProbPS improves the accuracy of peak selection which further enhances the performance of de novo sequencing and tag identification. Thus, our model saves valuable computation time and improving the accuracy of the results.
format Online
Article
Text
id pubmed-3179969
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31799692011-09-27 ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity Zhang, Shenghui Wang, Yaojun Bu, Dongbo Zhang, Hong Sun, Shiwei BMC Bioinformatics Research Article BACKGROUND: The analysis of mass spectra suggests that the existence of derivative peaks is strongly dependent on the intensity of the primary peaks. Peak selection from tandem mass spectrum is used to filter out noise and contaminant peaks. It is widely accepted that a valid primary peak tends to have high intensity and is accompanied by derivative peaks, including isotopic peaks, neutral loss peaks, and complementary peaks. Existing models for peak selection ignore the dependence between the existence of the derivative peaks and the intensity of the primary peaks. Simple models for peak selection assume that these two attributes are independent; however, this assumption is contrary to real data and prone to error. RESULTS: In this paper, we present a statistical model to quantitatively measure the dependence of the derivative peak's existence on the primary peak's intensity. Here, we propose a statistical model, named ProbPS, to capture the dependence in a quantitative manner and describe a statistical model for peak selection. Our results show that the quantitative understanding can successfully guide the peak selection process. By comparing ProbPS with AuDeNS we demonstrate the advantages of our method in both filtering out noise peaks and in improving de novo identification. In addition, we present a tag identification approach based on our peak selection method. Our results, using a test data set, suggest that our tag identification method (876 correct tags in 1000 spectra) outperforms PepNovoTag (790 correct tags in 1000 spectra). CONCLUSIONS: We have shown that ProbPS improves the accuracy of peak selection which further enhances the performance of de novo sequencing and tag identification. Thus, our model saves valuable computation time and improving the accuracy of the results. BioMed Central 2011-08-17 /pmc/articles/PMC3179969/ /pubmed/21849060 http://dx.doi.org/10.1186/1471-2105-12-346 Text en Copyright ©2011 Zhang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Shenghui
Wang, Yaojun
Bu, Dongbo
Zhang, Hong
Sun, Shiwei
ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
title ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
title_full ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
title_fullStr ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
title_full_unstemmed ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
title_short ProbPS: A new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
title_sort probps: a new model for peak selection based on quantifying the dependence of the existence of derivative peaks on primary ion intensity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3179969/
https://www.ncbi.nlm.nih.gov/pubmed/21849060
http://dx.doi.org/10.1186/1471-2105-12-346
work_keys_str_mv AT zhangshenghui probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity
AT wangyaojun probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity
AT budongbo probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity
AT zhanghong probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity
AT sunshiwei probpsanewmodelforpeakselectionbasedonquantifyingthedependenceoftheexistenceofderivativepeaksonprimaryionintensity