Cargando…

Deconvolution of expression for nascent RNA-sequencing data (DENR) highlights pre-RNA isoform diversity in human cells

MOTIVATION: Quantification of isoform abundance has been extensively studied at the mature RNA level using RNA-seq but not at the level of precursor RNAs using nascent RNA sequencing. RESULTS: We address this problem with a new computational method called Deconvolution of Expression for Nascent RNA-...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Yixin, Dukler, Noah, Barshad, Gilad, Toneyan, Shushan, Danko, Charles G, Siepel, Adam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8665767/
https://www.ncbi.nlm.nih.gov/pubmed/34382072
http://dx.doi.org/10.1093/bioinformatics/btab582
_version_ 1784614077254860800
author Zhao, Yixin
Dukler, Noah
Barshad, Gilad
Toneyan, Shushan
Danko, Charles G
Siepel, Adam
author_facet Zhao, Yixin
Dukler, Noah
Barshad, Gilad
Toneyan, Shushan
Danko, Charles G
Siepel, Adam
author_sort Zhao, Yixin
collection PubMed
description MOTIVATION: Quantification of isoform abundance has been extensively studied at the mature RNA level using RNA-seq but not at the level of precursor RNAs using nascent RNA sequencing. RESULTS: We address this problem with a new computational method called Deconvolution of Expression for Nascent RNA-sequencing data (DENR), which models nascent RNA-sequencing read-counts as a mixture of user-provided isoforms. The baseline algorithm is enhanced by machine-learning predictions of active transcription start sites and an adjustment for the typical ‘shape profile’ of read-counts along a transcription unit. We show that DENR outperforms simple read-count-based methods for estimating gene and isoform abundances, and that transcription of multiple pre-RNA isoforms per gene is widespread, with frequent differences between cell types. In addition, we provide evidence that a majority of human isoform diversity derives from primary transcription rather than from post-transcriptional processes. AVAILABILITY AND IMPLEMENTATION: DENR and nascentRNASim are freely available at https://github.com/CshlSiepelLab/DENR (version v1.0.0) and https://github.com/CshlSiepelLab/nascentRNASim (version v0.3.0). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8665767
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86657672021-12-13 Deconvolution of expression for nascent RNA-sequencing data (DENR) highlights pre-RNA isoform diversity in human cells Zhao, Yixin Dukler, Noah Barshad, Gilad Toneyan, Shushan Danko, Charles G Siepel, Adam Bioinformatics Original Papers MOTIVATION: Quantification of isoform abundance has been extensively studied at the mature RNA level using RNA-seq but not at the level of precursor RNAs using nascent RNA sequencing. RESULTS: We address this problem with a new computational method called Deconvolution of Expression for Nascent RNA-sequencing data (DENR), which models nascent RNA-sequencing read-counts as a mixture of user-provided isoforms. The baseline algorithm is enhanced by machine-learning predictions of active transcription start sites and an adjustment for the typical ‘shape profile’ of read-counts along a transcription unit. We show that DENR outperforms simple read-count-based methods for estimating gene and isoform abundances, and that transcription of multiple pre-RNA isoforms per gene is widespread, with frequent differences between cell types. In addition, we provide evidence that a majority of human isoform diversity derives from primary transcription rather than from post-transcriptional processes. AVAILABILITY AND IMPLEMENTATION: DENR and nascentRNASim are freely available at https://github.com/CshlSiepelLab/DENR (version v1.0.0) and https://github.com/CshlSiepelLab/nascentRNASim (version v0.3.0). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-08-11 /pmc/articles/PMC8665767/ /pubmed/34382072 http://dx.doi.org/10.1093/bioinformatics/btab582 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Zhao, Yixin
Dukler, Noah
Barshad, Gilad
Toneyan, Shushan
Danko, Charles G
Siepel, Adam
Deconvolution of expression for nascent RNA-sequencing data (DENR) highlights pre-RNA isoform diversity in human cells
title Deconvolution of expression for nascent RNA-sequencing data (DENR) highlights pre-RNA isoform diversity in human cells
title_full Deconvolution of expression for nascent RNA-sequencing data (DENR) highlights pre-RNA isoform diversity in human cells
title_fullStr Deconvolution of expression for nascent RNA-sequencing data (DENR) highlights pre-RNA isoform diversity in human cells
title_full_unstemmed Deconvolution of expression for nascent RNA-sequencing data (DENR) highlights pre-RNA isoform diversity in human cells
title_short Deconvolution of expression for nascent RNA-sequencing data (DENR) highlights pre-RNA isoform diversity in human cells
title_sort deconvolution of expression for nascent rna-sequencing data (denr) highlights pre-rna isoform diversity in human cells
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8665767/
https://www.ncbi.nlm.nih.gov/pubmed/34382072
http://dx.doi.org/10.1093/bioinformatics/btab582
work_keys_str_mv AT zhaoyixin deconvolutionofexpressionfornascentrnasequencingdatadenrhighlightsprernaisoformdiversityinhumancells
AT duklernoah deconvolutionofexpressionfornascentrnasequencingdatadenrhighlightsprernaisoformdiversityinhumancells
AT barshadgilad deconvolutionofexpressionfornascentrnasequencingdatadenrhighlightsprernaisoformdiversityinhumancells
AT toneyanshushan deconvolutionofexpressionfornascentrnasequencingdatadenrhighlightsprernaisoformdiversityinhumancells
AT dankocharlesg deconvolutionofexpressionfornascentrnasequencingdatadenrhighlightsprernaisoformdiversityinhumancells
AT siepeladam deconvolutionofexpressionfornascentrnasequencingdatadenrhighlightsprernaisoformdiversityinhumancells