Cargando…
Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification
Motivation: Tandem mass spectrometry (MS/MS) is an indispensable technology for identification of proteins from complex mixtures. Proteins are digested to peptides that are then identified by their fragmentation patterns in the mass spectrometer. Thus, at its core, MS/MS protein identification relie...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2665034/ https://www.ncbi.nlm.nih.gov/pubmed/18586734 http://dx.doi.org/10.1093/bioinformatics/btn189 |
_version_ | 1782166015094816768 |
---|---|
author | Klammer, Aaron A. Reynolds, Sheila M. Bilmes, Jeff A. MacCoss, Michael J. Noble, William Stafford |
author_facet | Klammer, Aaron A. Reynolds, Sheila M. Bilmes, Jeff A. MacCoss, Michael J. Noble, William Stafford |
author_sort | Klammer, Aaron A. |
collection | PubMed |
description | Motivation: Tandem mass spectrometry (MS/MS) is an indispensable technology for identification of proteins from complex mixtures. Proteins are digested to peptides that are then identified by their fragmentation patterns in the mass spectrometer. Thus, at its core, MS/MS protein identification relies on the relative predictability of peptide fragmentation. Unfortunately, peptide fragmentation is complex and not fully understood, and what is understood is not always exploited by peptide identification algorithms. Results: We use a hybrid dynamic Bayesian network (DBN)/support vector machine (SVM) approach to address these two problems. We train a set of DBNs on high-confidence peptide-spectrum matches. These DBNs, known collectively as Riptide, comprise a probabilistic model of peptide fragmentation chemistry. Examination of the distributions learned by Riptide allows identification of new trends, such as prevalent a-ion fragmentation at peptide cleavage sites C-term to hydrophobic residues. In addition, Riptide can be used to produce likelihood scores that indicate whether a given peptide-spectrum match is correct. A vector of such scores is evaluated by an SVM, which produces a final score to be used in peptide identification. Using Riptide in this way yields improved discrimination when compared to other state-of-the-art MS/MS identification algorithms, increasing the number of positive identifications by as much as 12% at a 1% false discovery rate. Availability: Python and C source code are available upon request from the authors. The curated training sets are available at http://noble.gs.washington.edu/proj/intense/. The Graphical Model Tool Kit (GMTK) is freely available at http://ssli.ee.washington.edu/bilmes/gmtk. Contact:noble@gs.washington.edu |
format | Text |
id | pubmed-2665034 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-26650342009-04-03 Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification Klammer, Aaron A. Reynolds, Sheila M. Bilmes, Jeff A. MacCoss, Michael J. Noble, William Stafford Bioinformatics Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto Motivation: Tandem mass spectrometry (MS/MS) is an indispensable technology for identification of proteins from complex mixtures. Proteins are digested to peptides that are then identified by their fragmentation patterns in the mass spectrometer. Thus, at its core, MS/MS protein identification relies on the relative predictability of peptide fragmentation. Unfortunately, peptide fragmentation is complex and not fully understood, and what is understood is not always exploited by peptide identification algorithms. Results: We use a hybrid dynamic Bayesian network (DBN)/support vector machine (SVM) approach to address these two problems. We train a set of DBNs on high-confidence peptide-spectrum matches. These DBNs, known collectively as Riptide, comprise a probabilistic model of peptide fragmentation chemistry. Examination of the distributions learned by Riptide allows identification of new trends, such as prevalent a-ion fragmentation at peptide cleavage sites C-term to hydrophobic residues. In addition, Riptide can be used to produce likelihood scores that indicate whether a given peptide-spectrum match is correct. A vector of such scores is evaluated by an SVM, which produces a final score to be used in peptide identification. Using Riptide in this way yields improved discrimination when compared to other state-of-the-art MS/MS identification algorithms, increasing the number of positive identifications by as much as 12% at a 1% false discovery rate. Availability: Python and C source code are available upon request from the authors. The curated training sets are available at http://noble.gs.washington.edu/proj/intense/. The Graphical Model Tool Kit (GMTK) is freely available at http://ssli.ee.washington.edu/bilmes/gmtk. Contact:noble@gs.washington.edu Oxford University Press 2008-07-01 /pmc/articles/PMC2665034/ /pubmed/18586734 http://dx.doi.org/10.1093/bioinformatics/btn189 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto Klammer, Aaron A. Reynolds, Sheila M. Bilmes, Jeff A. MacCoss, Michael J. Noble, William Stafford Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification |
title | Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification |
title_full | Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification |
title_fullStr | Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification |
title_full_unstemmed | Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification |
title_short | Modeling peptide fragmentation with dynamic Bayesian networks for peptide identification |
title_sort | modeling peptide fragmentation with dynamic bayesian networks for peptide identification |
topic | Ismb 2008 Conference Proceedings 19–23 July 2008, Toronto |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2665034/ https://www.ncbi.nlm.nih.gov/pubmed/18586734 http://dx.doi.org/10.1093/bioinformatics/btn189 |
work_keys_str_mv | AT klammeraarona modelingpeptidefragmentationwithdynamicbayesiannetworksforpeptideidentification AT reynoldssheilam modelingpeptidefragmentationwithdynamicbayesiannetworksforpeptideidentification AT bilmesjeffa modelingpeptidefragmentationwithdynamicbayesiannetworksforpeptideidentification AT maccossmichaelj modelingpeptidefragmentationwithdynamicbayesiannetworksforpeptideidentification AT noblewilliamstafford modelingpeptidefragmentationwithdynamicbayesiannetworksforpeptideidentification |