Cargando…
Faster and more accurate graphical model identification of tandem mass spectra using trellises
Tandem mass spectrometry (MS/MS) is the dominant high throughput technology for identifying and quantifying proteins in complex biological samples. Analysis of the tens of thousands of fragmentation spectra produced by an MS/MS experiment begins by assigning to each observed spectrum the peptide tha...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908353/ https://www.ncbi.nlm.nih.gov/pubmed/27307634 http://dx.doi.org/10.1093/bioinformatics/btw269 |
_version_ | 1782437666102444032 |
---|---|
author | Wang, Shengjie Halloran, John T. Bilmes, Jeff A. Noble, William S. |
author_facet | Wang, Shengjie Halloran, John T. Bilmes, Jeff A. Noble, William S. |
author_sort | Wang, Shengjie |
collection | PubMed |
description | Tandem mass spectrometry (MS/MS) is the dominant high throughput technology for identifying and quantifying proteins in complex biological samples. Analysis of the tens of thousands of fragmentation spectra produced by an MS/MS experiment begins by assigning to each observed spectrum the peptide that is hypothesized to be responsible for generating the spectrum. This assignment is typically done by searching each spectrum against a database of peptides. To our knowledge, all existing MS/MS search engines compute scores individually between a given observed spectrum and each possible candidate peptide from the database. In this work, we use a trellis, a data structure capable of jointly representing a large set of candidate peptides, to avoid redundantly recomputing common sub-computations among different candidates. We show how trellises may be used to significantly speed up existing scoring algorithms, and we theoretically quantify the expected speedup afforded by trellises. Furthermore, we demonstrate that compact trellis representations of whole sets of peptides enables efficient discriminative learning of a dynamic Bayesian network for spectrum identification, leading to greatly improved spectrum identification accuracy. Contact: bilmes@uw.edu or william-noble@uw.edu Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-4908353 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-49083532016-06-17 Faster and more accurate graphical model identification of tandem mass spectra using trellises Wang, Shengjie Halloran, John T. Bilmes, Jeff A. Noble, William S. Bioinformatics Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida Tandem mass spectrometry (MS/MS) is the dominant high throughput technology for identifying and quantifying proteins in complex biological samples. Analysis of the tens of thousands of fragmentation spectra produced by an MS/MS experiment begins by assigning to each observed spectrum the peptide that is hypothesized to be responsible for generating the spectrum. This assignment is typically done by searching each spectrum against a database of peptides. To our knowledge, all existing MS/MS search engines compute scores individually between a given observed spectrum and each possible candidate peptide from the database. In this work, we use a trellis, a data structure capable of jointly representing a large set of candidate peptides, to avoid redundantly recomputing common sub-computations among different candidates. We show how trellises may be used to significantly speed up existing scoring algorithms, and we theoretically quantify the expected speedup afforded by trellises. Furthermore, we demonstrate that compact trellis representations of whole sets of peptides enables efficient discriminative learning of a dynamic Bayesian network for spectrum identification, leading to greatly improved spectrum identification accuracy. Contact: bilmes@uw.edu or william-noble@uw.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-06-15 2016-06-11 /pmc/articles/PMC4908353/ /pubmed/27307634 http://dx.doi.org/10.1093/bioinformatics/btw269 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida Wang, Shengjie Halloran, John T. Bilmes, Jeff A. Noble, William S. Faster and more accurate graphical model identification of tandem mass spectra using trellises |
title | Faster and more accurate graphical model identification of tandem mass spectra using trellises |
title_full | Faster and more accurate graphical model identification of tandem mass spectra using trellises |
title_fullStr | Faster and more accurate graphical model identification of tandem mass spectra using trellises |
title_full_unstemmed | Faster and more accurate graphical model identification of tandem mass spectra using trellises |
title_short | Faster and more accurate graphical model identification of tandem mass spectra using trellises |
title_sort | faster and more accurate graphical model identification of tandem mass spectra using trellises |
topic | Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908353/ https://www.ncbi.nlm.nih.gov/pubmed/27307634 http://dx.doi.org/10.1093/bioinformatics/btw269 |
work_keys_str_mv | AT wangshengjie fasterandmoreaccurategraphicalmodelidentificationoftandemmassspectrausingtrellises AT halloranjohnt fasterandmoreaccurategraphicalmodelidentificationoftandemmassspectrausingtrellises AT bilmesjeffa fasterandmoreaccurategraphicalmodelidentificationoftandemmassspectrausingtrellises AT noblewilliams fasterandmoreaccurategraphicalmodelidentificationoftandemmassspectrausingtrellises |