Cargando…

Flexible expressed region analysis for RNA-seq with derfinder

Differential expression analysis of RNA sequencing (RNA-seq) data typically relies on reconstructing transcripts or counting reads that overlap known gene structures. We previously introduced an intermediate statistical approach called differentially expressed region (DER) finder that seeks to ident...

Descripción completa

Detalles Bibliográficos
Autores principales: Collado-Torres, Leonardo, Nellore, Abhinav, Frazee, Alyssa C., Wilks, Christopher, Love, Michael I., Langmead, Ben, Irizarry, Rafael A., Leek, Jeffrey T., Jaffe, Andrew E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5314792/
https://www.ncbi.nlm.nih.gov/pubmed/27694310
http://dx.doi.org/10.1093/nar/gkw852
_version_ 1782508584547909632
author Collado-Torres, Leonardo
Nellore, Abhinav
Frazee, Alyssa C.
Wilks, Christopher
Love, Michael I.
Langmead, Ben
Irizarry, Rafael A.
Leek, Jeffrey T.
Jaffe, Andrew E.
author_facet Collado-Torres, Leonardo
Nellore, Abhinav
Frazee, Alyssa C.
Wilks, Christopher
Love, Michael I.
Langmead, Ben
Irizarry, Rafael A.
Leek, Jeffrey T.
Jaffe, Andrew E.
author_sort Collado-Torres, Leonardo
collection PubMed
description Differential expression analysis of RNA sequencing (RNA-seq) data typically relies on reconstructing transcripts or counting reads that overlap known gene structures. We previously introduced an intermediate statistical approach called differentially expressed region (DER) finder that seeks to identify contiguous regions of the genome showing differential expression signal at single base resolution without relying on existing annotation or potentially inaccurate transcript assembly. We present the derfinder software that improves our annotation-agnostic approach to RNA-seq analysis by: (i) implementing a computationally efficient bump-hunting approach to identify DERs that permits genome-scale analyses in a large number of samples, (ii) introducing a flexible statistical modeling framework, including multi-group and time-course analyses and (iii) introducing a new set of data visualizations for expressed region analysis. We apply this approach to public RNA-seq data from the Genotype-Tissue Expression (GTEx) project and BrainSpan project to show that derfinder permits the analysis of hundreds of samples at base resolution in R, identifies expression outside of known gene boundaries and can be used to visualize expressed regions at base-resolution. In simulations, our base resolution approaches enable discovery in the presence of incomplete annotation and is nearly as powerful as feature-level methods when the annotation is complete. derfinder analysis using expressed region-level and single base-level approaches provides a compromise between full transcript reconstruction and feature-level analysis. The package is available from Bioconductor at www.bioconductor.org/packages/derfinder.
format Online
Article
Text
id pubmed-5314792
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-53147922017-02-21 Flexible expressed region analysis for RNA-seq with derfinder Collado-Torres, Leonardo Nellore, Abhinav Frazee, Alyssa C. Wilks, Christopher Love, Michael I. Langmead, Ben Irizarry, Rafael A. Leek, Jeffrey T. Jaffe, Andrew E. Nucleic Acids Res Methods Online Differential expression analysis of RNA sequencing (RNA-seq) data typically relies on reconstructing transcripts or counting reads that overlap known gene structures. We previously introduced an intermediate statistical approach called differentially expressed region (DER) finder that seeks to identify contiguous regions of the genome showing differential expression signal at single base resolution without relying on existing annotation or potentially inaccurate transcript assembly. We present the derfinder software that improves our annotation-agnostic approach to RNA-seq analysis by: (i) implementing a computationally efficient bump-hunting approach to identify DERs that permits genome-scale analyses in a large number of samples, (ii) introducing a flexible statistical modeling framework, including multi-group and time-course analyses and (iii) introducing a new set of data visualizations for expressed region analysis. We apply this approach to public RNA-seq data from the Genotype-Tissue Expression (GTEx) project and BrainSpan project to show that derfinder permits the analysis of hundreds of samples at base resolution in R, identifies expression outside of known gene boundaries and can be used to visualize expressed regions at base-resolution. In simulations, our base resolution approaches enable discovery in the presence of incomplete annotation and is nearly as powerful as feature-level methods when the annotation is complete. derfinder analysis using expressed region-level and single base-level approaches provides a compromise between full transcript reconstruction and feature-level analysis. The package is available from Bioconductor at www.bioconductor.org/packages/derfinder. Oxford University Press 2017-01-25 2016-09-29 /pmc/articles/PMC5314792/ /pubmed/27694310 http://dx.doi.org/10.1093/nar/gkw852 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Collado-Torres, Leonardo
Nellore, Abhinav
Frazee, Alyssa C.
Wilks, Christopher
Love, Michael I.
Langmead, Ben
Irizarry, Rafael A.
Leek, Jeffrey T.
Jaffe, Andrew E.
Flexible expressed region analysis for RNA-seq with derfinder
title Flexible expressed region analysis for RNA-seq with derfinder
title_full Flexible expressed region analysis for RNA-seq with derfinder
title_fullStr Flexible expressed region analysis for RNA-seq with derfinder
title_full_unstemmed Flexible expressed region analysis for RNA-seq with derfinder
title_short Flexible expressed region analysis for RNA-seq with derfinder
title_sort flexible expressed region analysis for rna-seq with derfinder
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5314792/
https://www.ncbi.nlm.nih.gov/pubmed/27694310
http://dx.doi.org/10.1093/nar/gkw852
work_keys_str_mv AT colladotorresleonardo flexibleexpressedregionanalysisforrnaseqwithderfinder
AT nelloreabhinav flexibleexpressedregionanalysisforrnaseqwithderfinder
AT frazeealyssac flexibleexpressedregionanalysisforrnaseqwithderfinder
AT wilkschristopher flexibleexpressedregionanalysisforrnaseqwithderfinder
AT lovemichaeli flexibleexpressedregionanalysisforrnaseqwithderfinder
AT langmeadben flexibleexpressedregionanalysisforrnaseqwithderfinder
AT irizarryrafaela flexibleexpressedregionanalysisforrnaseqwithderfinder
AT leekjeffreyt flexibleexpressedregionanalysisforrnaseqwithderfinder
AT jaffeandrewe flexibleexpressedregionanalysisforrnaseqwithderfinder