Cargando…
The genetic and biochemical determinants of mRNA degradation rates in mammals
BACKGROUND: Degradation rate is a fundamental aspect of mRNA metabolism, and the factors governing it remain poorly characterized. Understanding the genetic and biochemical determinants of mRNA half-life would enable more precise identification of variants that perturb gene expression through post-t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9684954/ https://www.ncbi.nlm.nih.gov/pubmed/36419176 http://dx.doi.org/10.1186/s13059-022-02811-x |
_version_ | 1784835399834664960 |
---|---|
author | Agarwal, Vikram Kelley, David R. |
author_facet | Agarwal, Vikram Kelley, David R. |
author_sort | Agarwal, Vikram |
collection | PubMed |
description | BACKGROUND: Degradation rate is a fundamental aspect of mRNA metabolism, and the factors governing it remain poorly characterized. Understanding the genetic and biochemical determinants of mRNA half-life would enable more precise identification of variants that perturb gene expression through post-transcriptional gene regulatory mechanisms. RESULTS: We establish a compendium of 39 human and 27 mouse transcriptome-wide mRNA decay rate datasets. A meta-analysis of these data identified a prevalence of technical noise and measurement bias, induced partially by the underlying experimental strategy. Correcting for these biases allowed us to derive more precise, consensus measurements of half-life which exhibit enhanced consistency between species. We trained substantially improved statistical models based upon genetic and biochemical features to better predict half-life and characterize the factors molding it. Our state-of-the-art model, Saluki, is a hybrid convolutional and recurrent deep neural network which relies only upon an mRNA sequence annotated with coding frame and splice sites to predict half-life (r=0.77). The key novel principle learned by Saluki is that the spatial positioning of splice sites, codons, and RNA-binding motifs within an mRNA is strongly associated with mRNA half-life. Saluki predicts the impact of RNA sequences and genetic mutations therein on mRNA stability, in agreement with functional measurements derived from massively parallel reporter assays. CONCLUSIONS: Our work produces a more robust ground truth for transcriptome-wide mRNA half-lives in mammalian cells. Using these revised measurements, we trained Saluki, a model that is over 50% more accurate in predicting half-life from sequence than existing models. Saluki succinctly captures many of the known determinants of mRNA half-life and can be rapidly deployed to predict the functional consequences of arbitrary mutations in the transcriptome. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02811-x. |
format | Online Article Text |
id | pubmed-9684954 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-96849542022-11-25 The genetic and biochemical determinants of mRNA degradation rates in mammals Agarwal, Vikram Kelley, David R. Genome Biol Research BACKGROUND: Degradation rate is a fundamental aspect of mRNA metabolism, and the factors governing it remain poorly characterized. Understanding the genetic and biochemical determinants of mRNA half-life would enable more precise identification of variants that perturb gene expression through post-transcriptional gene regulatory mechanisms. RESULTS: We establish a compendium of 39 human and 27 mouse transcriptome-wide mRNA decay rate datasets. A meta-analysis of these data identified a prevalence of technical noise and measurement bias, induced partially by the underlying experimental strategy. Correcting for these biases allowed us to derive more precise, consensus measurements of half-life which exhibit enhanced consistency between species. We trained substantially improved statistical models based upon genetic and biochemical features to better predict half-life and characterize the factors molding it. Our state-of-the-art model, Saluki, is a hybrid convolutional and recurrent deep neural network which relies only upon an mRNA sequence annotated with coding frame and splice sites to predict half-life (r=0.77). The key novel principle learned by Saluki is that the spatial positioning of splice sites, codons, and RNA-binding motifs within an mRNA is strongly associated with mRNA half-life. Saluki predicts the impact of RNA sequences and genetic mutations therein on mRNA stability, in agreement with functional measurements derived from massively parallel reporter assays. CONCLUSIONS: Our work produces a more robust ground truth for transcriptome-wide mRNA half-lives in mammalian cells. Using these revised measurements, we trained Saluki, a model that is over 50% more accurate in predicting half-life from sequence than existing models. Saluki succinctly captures many of the known determinants of mRNA half-life and can be rapidly deployed to predict the functional consequences of arbitrary mutations in the transcriptome. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02811-x. BioMed Central 2022-11-23 /pmc/articles/PMC9684954/ /pubmed/36419176 http://dx.doi.org/10.1186/s13059-022-02811-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Agarwal, Vikram Kelley, David R. The genetic and biochemical determinants of mRNA degradation rates in mammals |
title | The genetic and biochemical determinants of mRNA degradation rates in mammals |
title_full | The genetic and biochemical determinants of mRNA degradation rates in mammals |
title_fullStr | The genetic and biochemical determinants of mRNA degradation rates in mammals |
title_full_unstemmed | The genetic and biochemical determinants of mRNA degradation rates in mammals |
title_short | The genetic and biochemical determinants of mRNA degradation rates in mammals |
title_sort | genetic and biochemical determinants of mrna degradation rates in mammals |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9684954/ https://www.ncbi.nlm.nih.gov/pubmed/36419176 http://dx.doi.org/10.1186/s13059-022-02811-x |
work_keys_str_mv | AT agarwalvikram thegeneticandbiochemicaldeterminantsofmrnadegradationratesinmammals AT kelleydavidr thegeneticandbiochemicaldeterminantsofmrnadegradationratesinmammals AT agarwalvikram geneticandbiochemicaldeterminantsofmrnadegradationratesinmammals AT kelleydavidr geneticandbiochemicaldeterminantsofmrnadegradationratesinmammals |