Cargando…

PRAM: a novel pooling approach for discovering intergenic transcripts from large-scale RNA sequencing experiments

Publicly available RNA-seq data is routinely used for retrospective analysis to elucidate new biology. Novel transcript discovery enabled by joint analysis of large collections of RNA-seq data sets has emerged as one such analysis. Current methods for transcript discovery rely on a ‘2-Step’ approach...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Peng, Soukup, Alexandra A., Bresnick, Emery H., Dewey, Colin N., Keleş, Sündüz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7605252/
https://www.ncbi.nlm.nih.gov/pubmed/32958497
http://dx.doi.org/10.1101/gr.252445.119
_version_ 1783604273206001664
author Liu, Peng
Soukup, Alexandra A.
Bresnick, Emery H.
Dewey, Colin N.
Keleş, Sündüz
author_facet Liu, Peng
Soukup, Alexandra A.
Bresnick, Emery H.
Dewey, Colin N.
Keleş, Sündüz
author_sort Liu, Peng
collection PubMed
description Publicly available RNA-seq data is routinely used for retrospective analysis to elucidate new biology. Novel transcript discovery enabled by joint analysis of large collections of RNA-seq data sets has emerged as one such analysis. Current methods for transcript discovery rely on a ‘2-Step’ approach where the first step encompasses building transcripts from individual data sets, followed by the second step that merges predicted transcripts across data sets. To increase the power of transcript discovery from large collections of RNA-seq data sets, we developed a novel ‘1-Step’ approach named Pooling RNA-seq and Assembling Models (PRAM) that builds transcript models from pooled RNA-seq data sets. We demonstrate in a computational benchmark that 1-Step outperforms 2-Step approaches in predicting overall transcript structures and individual splice junctions, while performing competitively in detecting exonic nucleotides. Applying PRAM to 30 human ENCODE RNA-seq data sets identified unannotated transcripts with epigenetic and RAMPAGE signatures similar to those of recently annotated transcripts. In a case study, we discovered and experimentally validated new transcripts through the application of PRAM to mouse hematopoietic RNA-seq data sets. We uncovered new transcripts that share a differential expression pattern with a neighboring gene Pik3cg implicated in human hematopoietic phenotypes, and we provided evidence for the conservation of this relationship in human. PRAM is implemented as an R/Bioconductor package.
format Online
Article
Text
id pubmed-7605252
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-76052522020-11-12 PRAM: a novel pooling approach for discovering intergenic transcripts from large-scale RNA sequencing experiments Liu, Peng Soukup, Alexandra A. Bresnick, Emery H. Dewey, Colin N. Keleş, Sündüz Genome Res Method Publicly available RNA-seq data is routinely used for retrospective analysis to elucidate new biology. Novel transcript discovery enabled by joint analysis of large collections of RNA-seq data sets has emerged as one such analysis. Current methods for transcript discovery rely on a ‘2-Step’ approach where the first step encompasses building transcripts from individual data sets, followed by the second step that merges predicted transcripts across data sets. To increase the power of transcript discovery from large collections of RNA-seq data sets, we developed a novel ‘1-Step’ approach named Pooling RNA-seq and Assembling Models (PRAM) that builds transcript models from pooled RNA-seq data sets. We demonstrate in a computational benchmark that 1-Step outperforms 2-Step approaches in predicting overall transcript structures and individual splice junctions, while performing competitively in detecting exonic nucleotides. Applying PRAM to 30 human ENCODE RNA-seq data sets identified unannotated transcripts with epigenetic and RAMPAGE signatures similar to those of recently annotated transcripts. In a case study, we discovered and experimentally validated new transcripts through the application of PRAM to mouse hematopoietic RNA-seq data sets. We uncovered new transcripts that share a differential expression pattern with a neighboring gene Pik3cg implicated in human hematopoietic phenotypes, and we provided evidence for the conservation of this relationship in human. PRAM is implemented as an R/Bioconductor package. Cold Spring Harbor Laboratory Press 2020-11 /pmc/articles/PMC7605252/ /pubmed/32958497 http://dx.doi.org/10.1101/gr.252445.119 Text en © 2020 Liu et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Liu, Peng
Soukup, Alexandra A.
Bresnick, Emery H.
Dewey, Colin N.
Keleş, Sündüz
PRAM: a novel pooling approach for discovering intergenic transcripts from large-scale RNA sequencing experiments
title PRAM: a novel pooling approach for discovering intergenic transcripts from large-scale RNA sequencing experiments
title_full PRAM: a novel pooling approach for discovering intergenic transcripts from large-scale RNA sequencing experiments
title_fullStr PRAM: a novel pooling approach for discovering intergenic transcripts from large-scale RNA sequencing experiments
title_full_unstemmed PRAM: a novel pooling approach for discovering intergenic transcripts from large-scale RNA sequencing experiments
title_short PRAM: a novel pooling approach for discovering intergenic transcripts from large-scale RNA sequencing experiments
title_sort pram: a novel pooling approach for discovering intergenic transcripts from large-scale rna sequencing experiments
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7605252/
https://www.ncbi.nlm.nih.gov/pubmed/32958497
http://dx.doi.org/10.1101/gr.252445.119
work_keys_str_mv AT liupeng pramanovelpoolingapproachfordiscoveringintergenictranscriptsfromlargescalernasequencingexperiments
AT soukupalexandraa pramanovelpoolingapproachfordiscoveringintergenictranscriptsfromlargescalernasequencingexperiments
AT bresnickemeryh pramanovelpoolingapproachfordiscoveringintergenictranscriptsfromlargescalernasequencingexperiments
AT deweycolinn pramanovelpoolingapproachfordiscoveringintergenictranscriptsfromlargescalernasequencingexperiments
AT kelessunduz pramanovelpoolingapproachfordiscoveringintergenictranscriptsfromlargescalernasequencingexperiments