Cargando…

TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry data sets

Although mass spectrometry is well-suited to identifying thousands of possible protein post-translational modifications (PTMs), it has historically been biased towards just a few. To measure the entire set of PTMs across diverse proteomes, software must overcome the dual challenges of searching enor...

Descripción completa

Detalles Bibliográficos
Autores principales: Devabhaktuni, Arun, Lin, Sarah, Zhang, Lichao, Swaminathan, Kavya, Gonzales, Carlos, Olsson, Niclas, Pearlman, Sam, Rawson, Keith, Elias, Joshua E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6447449/
https://www.ncbi.nlm.nih.gov/pubmed/30936560
http://dx.doi.org/10.1038/s41587-019-0067-5
_version_ 1783408502571532288
author Devabhaktuni, Arun
Lin, Sarah
Zhang, Lichao
Swaminathan, Kavya
Gonzales, Carlos
Olsson, Niclas
Pearlman, Sam
Rawson, Keith
Elias, Joshua E.
author_facet Devabhaktuni, Arun
Lin, Sarah
Zhang, Lichao
Swaminathan, Kavya
Gonzales, Carlos
Olsson, Niclas
Pearlman, Sam
Rawson, Keith
Elias, Joshua E.
author_sort Devabhaktuni, Arun
collection PubMed
description Although mass spectrometry is well-suited to identifying thousands of possible protein post-translational modifications (PTMs), it has historically been biased towards just a few. To measure the entire set of PTMs across diverse proteomes, software must overcome the dual challenges of searching enormous search spaces and distinguishing correct from incorrect spectrum interpretations. Here, we describe TagGraph, a computational tool that overcomes both challenges with an unrestricted string-based search method that is as much as 350-fold faster than existing approaches, and a probabilistic validation model we optimized for PTM assignments. We applied TagGraph to a published human proteomic data set of 25 million mass spectra and tripled confident spectrum identifications compared its original analysis. We identified thousands of modification types on almost one million sites in the proteome. We show new contexts for highly abundant yet understudied PTMs such as proline hydroxylation, and its unexpected association with cancer mutations. By enabling broad PTM characterization TagGraph informs how their functions and regulation intersect.
format Online
Article
Text
id pubmed-6447449
institution National Center for Biotechnology Information
language English
publishDate 2019
record_format MEDLINE/PubMed
spelling pubmed-64474492019-10-01 TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry data sets Devabhaktuni, Arun Lin, Sarah Zhang, Lichao Swaminathan, Kavya Gonzales, Carlos Olsson, Niclas Pearlman, Sam Rawson, Keith Elias, Joshua E. Nat Biotechnol Article Although mass spectrometry is well-suited to identifying thousands of possible protein post-translational modifications (PTMs), it has historically been biased towards just a few. To measure the entire set of PTMs across diverse proteomes, software must overcome the dual challenges of searching enormous search spaces and distinguishing correct from incorrect spectrum interpretations. Here, we describe TagGraph, a computational tool that overcomes both challenges with an unrestricted string-based search method that is as much as 350-fold faster than existing approaches, and a probabilistic validation model we optimized for PTM assignments. We applied TagGraph to a published human proteomic data set of 25 million mass spectra and tripled confident spectrum identifications compared its original analysis. We identified thousands of modification types on almost one million sites in the proteome. We show new contexts for highly abundant yet understudied PTMs such as proline hydroxylation, and its unexpected association with cancer mutations. By enabling broad PTM characterization TagGraph informs how their functions and regulation intersect. 2019-04-01 2019-04 /pmc/articles/PMC6447449/ /pubmed/30936560 http://dx.doi.org/10.1038/s41587-019-0067-5 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Devabhaktuni, Arun
Lin, Sarah
Zhang, Lichao
Swaminathan, Kavya
Gonzales, Carlos
Olsson, Niclas
Pearlman, Sam
Rawson, Keith
Elias, Joshua E.
TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry data sets
title TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry data sets
title_full TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry data sets
title_fullStr TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry data sets
title_full_unstemmed TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry data sets
title_short TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry data sets
title_sort taggraph reveals vast protein modification landscapes from large tandem mass spectrometry data sets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6447449/
https://www.ncbi.nlm.nih.gov/pubmed/30936560
http://dx.doi.org/10.1038/s41587-019-0067-5
work_keys_str_mv AT devabhaktuniarun taggraphrevealsvastproteinmodificationlandscapesfromlargetandemmassspectrometrydatasets
AT linsarah taggraphrevealsvastproteinmodificationlandscapesfromlargetandemmassspectrometrydatasets
AT zhanglichao taggraphrevealsvastproteinmodificationlandscapesfromlargetandemmassspectrometrydatasets
AT swaminathankavya taggraphrevealsvastproteinmodificationlandscapesfromlargetandemmassspectrometrydatasets
AT gonzalescarlos taggraphrevealsvastproteinmodificationlandscapesfromlargetandemmassspectrometrydatasets
AT olssonniclas taggraphrevealsvastproteinmodificationlandscapesfromlargetandemmassspectrometrydatasets
AT pearlmansam taggraphrevealsvastproteinmodificationlandscapesfromlargetandemmassspectrometrydatasets
AT rawsonkeith taggraphrevealsvastproteinmodificationlandscapesfromlargetandemmassspectrometrydatasets
AT eliasjoshuae taggraphrevealsvastproteinmodificationlandscapesfromlargetandemmassspectrometrydatasets