Cargando…

System wide analyses have underestimated protein abundances and the importance of transcription in mammals

Large scale surveys in mammalian tissue culture cells suggest that the protein expressed at the median abundance is present at 8,000–16,000 molecules per cell and that differences in mRNA expression between genes explain only 10–40% of the differences in protein levels. We find, however, that these...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jingyi Jessica, Bickel, Peter J, Biggin, Mark D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3940484/
https://www.ncbi.nlm.nih.gov/pubmed/24688849
http://dx.doi.org/10.7717/peerj.270
_version_ 1782305795493330944
author Li, Jingyi Jessica
Bickel, Peter J
Biggin, Mark D
author_facet Li, Jingyi Jessica
Bickel, Peter J
Biggin, Mark D
author_sort Li, Jingyi Jessica
collection PubMed
description Large scale surveys in mammalian tissue culture cells suggest that the protein expressed at the median abundance is present at 8,000–16,000 molecules per cell and that differences in mRNA expression between genes explain only 10–40% of the differences in protein levels. We find, however, that these surveys have significantly underestimated protein abundances and the relative importance of transcription. Using individual measurements for 61 housekeeping proteins to rescale whole proteome data from Schwanhausser et al. (2011), we find that the median protein detected is expressed at 170,000 molecules per cell and that our corrected protein abundance estimates show a higher correlation with mRNA abundances than do the uncorrected protein data. In addition, we estimated the impact of further errors in mRNA and protein abundances using direct experimental measurements of these errors. The resulting analysis suggests that mRNA levels explain at least 56% of the differences in protein abundance for the 4,212 genes detected by Schwanhausser et al. (2011), though because one major source of error could not be estimated the true percent contribution should be higher. We also employed a second, independent strategy to determine the contribution of mRNA levels to protein expression. We show that the variance in translation rates directly measured by ribosome profiling is only 12% of that inferred by Schwanhausser et al. (2011), and that the measured and inferred translation rates correlate poorly (R(2) = 0.13). Based on this, our second strategy suggests that mRNA levels explain ∼81% of the variance in protein levels. We also determined the percent contributions of transcription, RNA degradation, translation and protein degradation to the variance in protein abundances using both of our strategies. While the magnitudes of the two estimates vary, they both suggest that transcription plays a more important role than the earlier studies implied and translation a much smaller role. Finally, the above estimates only apply to those genes whose mRNA and protein expression was detected. Based on a detailed analysis by Hebenstreit et al. (2012), we estimate that approximately 40% of genes in a given cell within a population express no mRNA. Since there can be no translation in the absence of mRNA, we argue that differences in translation rates can play no role in determining the expression levels for the ∼40% of genes that are non-expressed.
format Online
Article
Text
id pubmed-3940484
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-39404842014-03-31 System wide analyses have underestimated protein abundances and the importance of transcription in mammals Li, Jingyi Jessica Bickel, Peter J Biggin, Mark D PeerJ Bioinformatics Large scale surveys in mammalian tissue culture cells suggest that the protein expressed at the median abundance is present at 8,000–16,000 molecules per cell and that differences in mRNA expression between genes explain only 10–40% of the differences in protein levels. We find, however, that these surveys have significantly underestimated protein abundances and the relative importance of transcription. Using individual measurements for 61 housekeeping proteins to rescale whole proteome data from Schwanhausser et al. (2011), we find that the median protein detected is expressed at 170,000 molecules per cell and that our corrected protein abundance estimates show a higher correlation with mRNA abundances than do the uncorrected protein data. In addition, we estimated the impact of further errors in mRNA and protein abundances using direct experimental measurements of these errors. The resulting analysis suggests that mRNA levels explain at least 56% of the differences in protein abundance for the 4,212 genes detected by Schwanhausser et al. (2011), though because one major source of error could not be estimated the true percent contribution should be higher. We also employed a second, independent strategy to determine the contribution of mRNA levels to protein expression. We show that the variance in translation rates directly measured by ribosome profiling is only 12% of that inferred by Schwanhausser et al. (2011), and that the measured and inferred translation rates correlate poorly (R(2) = 0.13). Based on this, our second strategy suggests that mRNA levels explain ∼81% of the variance in protein levels. We also determined the percent contributions of transcription, RNA degradation, translation and protein degradation to the variance in protein abundances using both of our strategies. While the magnitudes of the two estimates vary, they both suggest that transcription plays a more important role than the earlier studies implied and translation a much smaller role. Finally, the above estimates only apply to those genes whose mRNA and protein expression was detected. Based on a detailed analysis by Hebenstreit et al. (2012), we estimate that approximately 40% of genes in a given cell within a population express no mRNA. Since there can be no translation in the absence of mRNA, we argue that differences in translation rates can play no role in determining the expression levels for the ∼40% of genes that are non-expressed. PeerJ Inc. 2014-02-27 /pmc/articles/PMC3940484/ /pubmed/24688849 http://dx.doi.org/10.7717/peerj.270 Text en http://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, made available under the Creative Commons Public Domain Dedication (http://creativecommons.org/publicdomain/zero/1.0/) . This work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Bioinformatics
Li, Jingyi Jessica
Bickel, Peter J
Biggin, Mark D
System wide analyses have underestimated protein abundances and the importance of transcription in mammals
title System wide analyses have underestimated protein abundances and the importance of transcription in mammals
title_full System wide analyses have underestimated protein abundances and the importance of transcription in mammals
title_fullStr System wide analyses have underestimated protein abundances and the importance of transcription in mammals
title_full_unstemmed System wide analyses have underestimated protein abundances and the importance of transcription in mammals
title_short System wide analyses have underestimated protein abundances and the importance of transcription in mammals
title_sort system wide analyses have underestimated protein abundances and the importance of transcription in mammals
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3940484/
https://www.ncbi.nlm.nih.gov/pubmed/24688849
http://dx.doi.org/10.7717/peerj.270
work_keys_str_mv AT lijingyijessica systemwideanalyseshaveunderestimatedproteinabundancesandtheimportanceoftranscriptioninmammals
AT bickelpeterj systemwideanalyseshaveunderestimatedproteinabundancesandtheimportanceoftranscriptioninmammals
AT bigginmarkd systemwideanalyseshaveunderestimatedproteinabundancesandtheimportanceoftranscriptioninmammals