Cargando…

Faster and more accurate graphical model identification of tandem mass spectra using trellises

Tandem mass spectrometry (MS/MS) is the dominant high throughput technology for identifying and quantifying proteins in complex biological samples. Analysis of the tens of thousands of fragmentation spectra produced by an MS/MS experiment begins by assigning to each observed spectrum the peptide tha...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Shengjie, Halloran, John T., Bilmes, Jeff A., Noble, William S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908353/
https://www.ncbi.nlm.nih.gov/pubmed/27307634
http://dx.doi.org/10.1093/bioinformatics/btw269
_version_ 1782437666102444032
author Wang, Shengjie
Halloran, John T.
Bilmes, Jeff A.
Noble, William S.
author_facet Wang, Shengjie
Halloran, John T.
Bilmes, Jeff A.
Noble, William S.
author_sort Wang, Shengjie
collection PubMed
description Tandem mass spectrometry (MS/MS) is the dominant high throughput technology for identifying and quantifying proteins in complex biological samples. Analysis of the tens of thousands of fragmentation spectra produced by an MS/MS experiment begins by assigning to each observed spectrum the peptide that is hypothesized to be responsible for generating the spectrum. This assignment is typically done by searching each spectrum against a database of peptides. To our knowledge, all existing MS/MS search engines compute scores individually between a given observed spectrum and each possible candidate peptide from the database. In this work, we use a trellis, a data structure capable of jointly representing a large set of candidate peptides, to avoid redundantly recomputing common sub-computations among different candidates. We show how trellises may be used to significantly speed up existing scoring algorithms, and we theoretically quantify the expected speedup afforded by trellises. Furthermore, we demonstrate that compact trellis representations of whole sets of peptides enables efficient discriminative learning of a dynamic Bayesian network for spectrum identification, leading to greatly improved spectrum identification accuracy. Contact: bilmes@uw.edu or william-noble@uw.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4908353
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-49083532016-06-17 Faster and more accurate graphical model identification of tandem mass spectra using trellises Wang, Shengjie Halloran, John T. Bilmes, Jeff A. Noble, William S. Bioinformatics Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida Tandem mass spectrometry (MS/MS) is the dominant high throughput technology for identifying and quantifying proteins in complex biological samples. Analysis of the tens of thousands of fragmentation spectra produced by an MS/MS experiment begins by assigning to each observed spectrum the peptide that is hypothesized to be responsible for generating the spectrum. This assignment is typically done by searching each spectrum against a database of peptides. To our knowledge, all existing MS/MS search engines compute scores individually between a given observed spectrum and each possible candidate peptide from the database. In this work, we use a trellis, a data structure capable of jointly representing a large set of candidate peptides, to avoid redundantly recomputing common sub-computations among different candidates. We show how trellises may be used to significantly speed up existing scoring algorithms, and we theoretically quantify the expected speedup afforded by trellises. Furthermore, we demonstrate that compact trellis representations of whole sets of peptides enables efficient discriminative learning of a dynamic Bayesian network for spectrum identification, leading to greatly improved spectrum identification accuracy. Contact: bilmes@uw.edu or william-noble@uw.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-06-15 2016-06-11 /pmc/articles/PMC4908353/ /pubmed/27307634 http://dx.doi.org/10.1093/bioinformatics/btw269 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida
Wang, Shengjie
Halloran, John T.
Bilmes, Jeff A.
Noble, William S.
Faster and more accurate graphical model identification of tandem mass spectra using trellises
title Faster and more accurate graphical model identification of tandem mass spectra using trellises
title_full Faster and more accurate graphical model identification of tandem mass spectra using trellises
title_fullStr Faster and more accurate graphical model identification of tandem mass spectra using trellises
title_full_unstemmed Faster and more accurate graphical model identification of tandem mass spectra using trellises
title_short Faster and more accurate graphical model identification of tandem mass spectra using trellises
title_sort faster and more accurate graphical model identification of tandem mass spectra using trellises
topic Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908353/
https://www.ncbi.nlm.nih.gov/pubmed/27307634
http://dx.doi.org/10.1093/bioinformatics/btw269
work_keys_str_mv AT wangshengjie fasterandmoreaccurategraphicalmodelidentificationoftandemmassspectrausingtrellises
AT halloranjohnt fasterandmoreaccurategraphicalmodelidentificationoftandemmassspectrausingtrellises
AT bilmesjeffa fasterandmoreaccurategraphicalmodelidentificationoftandemmassspectrausingtrellises
AT noblewilliams fasterandmoreaccurategraphicalmodelidentificationoftandemmassspectrausingtrellises