Cargando…

Utility of RNA-seq and GPMDB Protein Observation Frequency for Improving the Sensitivity of Protein Identification by Tandem MS

[Image: see text] Tandem mass spectrometry (MS/MS) followed by database search is the method of choice for protein identification in proteomic studies. Database searching methods employ spectral matching algorithms and statistical models to identify and quantify proteins in a sample. In general, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Shanmugam, Avinash K., Yocum, Anastasia K., Nesvizhskii, Alexey I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2014
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4156250/
https://www.ncbi.nlm.nih.gov/pubmed/25026199
http://dx.doi.org/10.1021/pr500496p
_version_ 1782333701800067072
author Shanmugam, Avinash K.
Yocum, Anastasia K.
Nesvizhskii, Alexey I.
author_facet Shanmugam, Avinash K.
Yocum, Anastasia K.
Nesvizhskii, Alexey I.
author_sort Shanmugam, Avinash K.
collection PubMed
description [Image: see text] Tandem mass spectrometry (MS/MS) followed by database search is the method of choice for protein identification in proteomic studies. Database searching methods employ spectral matching algorithms and statistical models to identify and quantify proteins in a sample. In general, these methods do not utilize any information other than spectral data for protein identification. However, considering the wealth of external data available for many biological systems, analysis methods can incorporate such information to improve the sensitivity of protein identification. In this study, we present a method to utilize Global Proteome Machine Database identification frequencies and RNA-seq transcript abundances to adjust the confidence scores of protein identifications. The method described is particularly useful for samples with low-to-moderate proteome coverage (i.e., <2000–3000 proteins), where we observe up to an 8% improvement in the number of proteins identified at a 1% false discovery rate.
format Online
Article
Text
id pubmed-4156250
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-41562502015-07-15 Utility of RNA-seq and GPMDB Protein Observation Frequency for Improving the Sensitivity of Protein Identification by Tandem MS Shanmugam, Avinash K. Yocum, Anastasia K. Nesvizhskii, Alexey I. J Proteome Res [Image: see text] Tandem mass spectrometry (MS/MS) followed by database search is the method of choice for protein identification in proteomic studies. Database searching methods employ spectral matching algorithms and statistical models to identify and quantify proteins in a sample. In general, these methods do not utilize any information other than spectral data for protein identification. However, considering the wealth of external data available for many biological systems, analysis methods can incorporate such information to improve the sensitivity of protein identification. In this study, we present a method to utilize Global Proteome Machine Database identification frequencies and RNA-seq transcript abundances to adjust the confidence scores of protein identifications. The method described is particularly useful for samples with low-to-moderate proteome coverage (i.e., <2000–3000 proteins), where we observe up to an 8% improvement in the number of proteins identified at a 1% false discovery rate. American Chemical Society 2014-07-15 2014-09-05 /pmc/articles/PMC4156250/ /pubmed/25026199 http://dx.doi.org/10.1021/pr500496p Text en Copyright © 2014 American Chemical Society Terms of Use (http://pubs.acs.org/page/policy/authorchoice_termsofuse.html)
spellingShingle Shanmugam, Avinash K.
Yocum, Anastasia K.
Nesvizhskii, Alexey I.
Utility of RNA-seq and GPMDB Protein Observation Frequency for Improving the Sensitivity of Protein Identification by Tandem MS
title Utility of RNA-seq and GPMDB Protein Observation Frequency for Improving the Sensitivity of Protein Identification by Tandem MS
title_full Utility of RNA-seq and GPMDB Protein Observation Frequency for Improving the Sensitivity of Protein Identification by Tandem MS
title_fullStr Utility of RNA-seq and GPMDB Protein Observation Frequency for Improving the Sensitivity of Protein Identification by Tandem MS
title_full_unstemmed Utility of RNA-seq and GPMDB Protein Observation Frequency for Improving the Sensitivity of Protein Identification by Tandem MS
title_short Utility of RNA-seq and GPMDB Protein Observation Frequency for Improving the Sensitivity of Protein Identification by Tandem MS
title_sort utility of rna-seq and gpmdb protein observation frequency for improving the sensitivity of protein identification by tandem ms
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4156250/
https://www.ncbi.nlm.nih.gov/pubmed/25026199
http://dx.doi.org/10.1021/pr500496p
work_keys_str_mv AT shanmugamavinashk utilityofrnaseqandgpmdbproteinobservationfrequencyforimprovingthesensitivityofproteinidentificationbytandemms
AT yocumanastasiak utilityofrnaseqandgpmdbproteinobservationfrequencyforimprovingthesensitivityofproteinidentificationbytandemms
AT nesvizhskiialexeyi utilityofrnaseqandgpmdbproteinobservationfrequencyforimprovingthesensitivityofproteinidentificationbytandemms