Cargando…

Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification

Motivation: Tandem mass spectrometry (MS/MS) is an indispensable technology for identification of proteins from complex mixtures. Proteins are digested to peptides that are then identified by their fragmentation patterns in the mass spectrometer. Thus, at its core, MS/MS protein identification relie...

Descripción completa

Detalles Bibliográficos
Autores principales: Klammer, Aaron A., Reynolds, Sheila M., Bilmes, Jeff A., MacCoss, Michael J., Noble, William Stafford
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2665034/
https://www.ncbi.nlm.nih.gov/pubmed/18586734
http://dx.doi.org/10.1093/bioinformatics/btn189
_version_ 1782166015094816768
author Klammer, Aaron A.
Reynolds, Sheila M.
Bilmes, Jeff A.
MacCoss, Michael J.
Noble, William Stafford
author_facet Klammer, Aaron A.
Reynolds, Sheila M.
Bilmes, Jeff A.
MacCoss, Michael J.
Noble, William Stafford
author_sort Klammer, Aaron A.
collection PubMed
description Motivation: Tandem mass spectrometry (MS/MS) is an indispensable technology for identification of proteins from complex mixtures. Proteins are digested to peptides that are then identified by their fragmentation patterns in the mass spectrometer. Thus, at its core, MS/MS protein identification relies on the relative predictability of peptide fragmentation. Unfortunately, peptide fragmentation is complex and not fully understood, and what is understood is not always exploited by peptide identification algorithms. Results: We use a hybrid dynamic Bayesian network (DBN)/support vector machine (SVM) approach to address these two problems. We train a set of DBNs on high-confidence peptide-spectrum matches. These DBNs, known collectively as Riptide, comprise a probabilistic model of peptide fragmentation chemistry. Examination of the distributions learned by Riptide allows identification of new trends, such as prevalent a-ion fragmentation at peptide cleavage sites C-term to hydrophobic residues. In addition, Riptide can be used to produce likelihood scores that indicate whether a given peptide-spectrum match is correct. A vector of such scores is evaluated by an SVM, which produces a final score to be used in peptide identification. Using Riptide in this way yields improved discrimination when compared to other state-of-the-art MS/MS identification algorithms, increasing the number of positive identifications by as much as 12% at a 1% false discovery rate. Availability: Python and C source code are available upon request from the authors. The curated training sets are available at http://noble.gs.washington.edu/proj/intense/. The Graphical Model Tool Kit (GMTK) is freely available at http://ssli.ee.washington.edu/bilmes/gmtk. Contact:noble@gs.washington.edu
format Text
id pubmed-2665034
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-26650342009-04-03 Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification Klammer, Aaron A. Reynolds, Sheila M. Bilmes, Jeff A. MacCoss, Michael J. Noble, William Stafford Bioinformatics Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto Motivation: Tandem mass spectrometry (MS/MS) is an indispensable technology for identification of proteins from complex mixtures. Proteins are digested to peptides that are then identified by their fragmentation patterns in the mass spectrometer. Thus, at its core, MS/MS protein identification relies on the relative predictability of peptide fragmentation. Unfortunately, peptide fragmentation is complex and not fully understood, and what is understood is not always exploited by peptide identification algorithms. Results: We use a hybrid dynamic Bayesian network (DBN)/support vector machine (SVM) approach to address these two problems. We train a set of DBNs on high-confidence peptide-spectrum matches. These DBNs, known collectively as Riptide, comprise a probabilistic model of peptide fragmentation chemistry. Examination of the distributions learned by Riptide allows identification of new trends, such as prevalent a-ion fragmentation at peptide cleavage sites C-term to hydrophobic residues. In addition, Riptide can be used to produce likelihood scores that indicate whether a given peptide-spectrum match is correct. A vector of such scores is evaluated by an SVM, which produces a final score to be used in peptide identification. Using Riptide in this way yields improved discrimination when compared to other state-of-the-art MS/MS identification algorithms, increasing the number of positive identifications by as much as 12% at a 1% false discovery rate. Availability: Python and C source code are available upon request from the authors. The curated training sets are available at http://noble.gs.washington.edu/proj/intense/. The Graphical Model Tool Kit (GMTK) is freely available at http://ssli.ee.washington.edu/bilmes/gmtk. Contact:noble@gs.washington.edu Oxford University Press 2008-07-01 /pmc/articles/PMC2665034/ /pubmed/18586734 http://dx.doi.org/10.1093/bioinformatics/btn189 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto
Klammer, Aaron A.
Reynolds, Sheila M.
Bilmes, Jeff A.
MacCoss, Michael J.
Noble, William Stafford
Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification
title Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification
title_full Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification
title_fullStr Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification
title_full_unstemmed Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification
title_short Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification
title_sort modeling peptide fragmentation with dynamic bayesian networks for peptide identification
topic Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2665034/
https://www.ncbi.nlm.nih.gov/pubmed/18586734
http://dx.doi.org/10.1093/bioinformatics/btn189
work_keys_str_mv AT klammeraarona modelingpeptidefragmentationwithdynamicbayesiannetworksforpeptideidentification
AT reynoldssheilam modelingpeptidefragmentationwithdynamicbayesiannetworksforpeptideidentification
AT bilmesjeffa modelingpeptidefragmentationwithdynamicbayesiannetworksforpeptideidentification
AT maccossmichaelj modelingpeptidefragmentationwithdynamicbayesiannetworksforpeptideidentification
AT noblewilliamstafford modelingpeptidefragmentationwithdynamicbayesiannetworksforpeptideidentification