Cargando…

Proteome-wide copy-number estimation from transcriptomics

Protein copy numbers constrain systems-level properties of regulatory networks, but absolute proteomic data remain scarce compared to transcriptomics obtained by RNA sequencing. We addressed this persistent gap by relating mRNA to protein statistically using best-available data from quantitative pro...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sweatt, Andrew J., Griffiths, Cameron D., Paudel, B. Bishal, Janes, Kevin A.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Cold Spring Harbor Laboratory 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10369941/ https://www.ncbi.nlm.nih.gov/pubmed/37503057 http://dx.doi.org/10.1101/2023.07.10.548432

_version_	1785077864811462656
author	Sweatt, Andrew J. Griffiths, Cameron D. Paudel, B. Bishal Janes, Kevin A.
author_facet	Sweatt, Andrew J. Griffiths, Cameron D. Paudel, B. Bishal Janes, Kevin A.
author_sort	Sweatt, Andrew J.
collection	PubMed
description	Protein copy numbers constrain systems-level properties of regulatory networks, but absolute proteomic data remain scarce compared to transcriptomics obtained by RNA sequencing. We addressed this persistent gap by relating mRNA to protein statistically using best-available data from quantitative proteomics–transcriptomics for 4366 genes in 369 cell lines. The approach starts with a central estimate of protein copy number and hierarchically appends mRNA-protein and mRNA-mRNA dependencies to define an optimal gene-specific model that links mRNAs to protein. For dozens of independent cell lines and primary prostate samples, these protein inferences from mRNA outmatch stringent null models, a count-based protein-abundance repository, and empirical protein-to-mRNA ratios. The optimal mRNA-to-protein relationships capture biological processes along with hundreds of known protein-protein interaction complexes, suggesting mechanistic relationships are embedded. We use the method to estimate viral-receptor abundances of CD55–CXADR from human heart transcriptomes and build 1489 systems-biology models of coxsackievirus B3 infection susceptibility. When applied to 796 RNA sequencing profiles of breast cancer from The Cancer Genome Atlas, inferred copy-number estimates collectively reclassify 26% of Luminal A and 29% of Luminal B tumors. Protein-based reassignments strongly involve a pharmacologic target for luminal breast cancer (CDK4) and an α-catenin that is often undetectable at the mRNA level (CTTNA2). Thus, by adopting a gene-centered perspective of mRNA-protein covariation across different biological contexts, we achieve accuracies comparable to the technical reproducibility limits of contemporary proteomics. The collection of gene-specific models is assembled as a web tool for users seeking mRNA-guided predictions of absolute protein abundance (http://janeslab.shinyapps.io/Pinferna).
format	Online Article Text
id	pubmed-10369941
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Cold Spring Harbor Laboratory
record_format	MEDLINE/PubMed
spelling	pubmed-103699412023-07-27 Proteome-wide copy-number estimation from transcriptomics Sweatt, Andrew J. Griffiths, Cameron D. Paudel, B. Bishal Janes, Kevin A. bioRxiv Article Protein copy numbers constrain systems-level properties of regulatory networks, but absolute proteomic data remain scarce compared to transcriptomics obtained by RNA sequencing. We addressed this persistent gap by relating mRNA to protein statistically using best-available data from quantitative proteomics–transcriptomics for 4366 genes in 369 cell lines. The approach starts with a central estimate of protein copy number and hierarchically appends mRNA-protein and mRNA-mRNA dependencies to define an optimal gene-specific model that links mRNAs to protein. For dozens of independent cell lines and primary prostate samples, these protein inferences from mRNA outmatch stringent null models, a count-based protein-abundance repository, and empirical protein-to-mRNA ratios. The optimal mRNA-to-protein relationships capture biological processes along with hundreds of known protein-protein interaction complexes, suggesting mechanistic relationships are embedded. We use the method to estimate viral-receptor abundances of CD55–CXADR from human heart transcriptomes and build 1489 systems-biology models of coxsackievirus B3 infection susceptibility. When applied to 796 RNA sequencing profiles of breast cancer from The Cancer Genome Atlas, inferred copy-number estimates collectively reclassify 26% of Luminal A and 29% of Luminal B tumors. Protein-based reassignments strongly involve a pharmacologic target for luminal breast cancer (CDK4) and an α-catenin that is often undetectable at the mRNA level (CTTNA2). Thus, by adopting a gene-centered perspective of mRNA-protein covariation across different biological contexts, we achieve accuracies comparable to the technical reproducibility limits of contemporary proteomics. The collection of gene-specific models is assembled as a web tool for users seeking mRNA-guided predictions of absolute protein abundance (http://janeslab.shinyapps.io/Pinferna). Cold Spring Harbor Laboratory 2023-07-11 /pmc/articles/PMC10369941/ /pubmed/37503057 http://dx.doi.org/10.1101/2023.07.10.548432 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle	Article Sweatt, Andrew J. Griffiths, Cameron D. Paudel, B. Bishal Janes, Kevin A. Proteome-wide copy-number estimation from transcriptomics
title	Proteome-wide copy-number estimation from transcriptomics
title_full	Proteome-wide copy-number estimation from transcriptomics
title_fullStr	Proteome-wide copy-number estimation from transcriptomics
title_full_unstemmed	Proteome-wide copy-number estimation from transcriptomics
title_short	Proteome-wide copy-number estimation from transcriptomics
title_sort	proteome-wide copy-number estimation from transcriptomics
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10369941/ https://www.ncbi.nlm.nih.gov/pubmed/37503057 http://dx.doi.org/10.1101/2023.07.10.548432
work_keys_str_mv	AT sweattandrewj proteomewidecopynumberestimationfromtranscriptomics AT griffithscamerond proteomewidecopynumberestimationfromtranscriptomics AT paudelbbishal proteomewidecopynumberestimationfromtranscriptomics AT janeskevina proteomewidecopynumberestimationfromtranscriptomics

Proteome-wide copy-number estimation from transcriptomics

Ejemplares similares