Cargando…

The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads

We present Rsubread, a Bioconductor software package that provides high-performance alignment and read counting functions for RNA-seq reads. Rsubread is based on the successful Subread suite with the added ease-of-use of the R programming environment, creating a matrix of read counts directly as an...

Descripción completa

Detalles Bibliográficos
Autores principales: Liao, Yang, Smyth, Gordon K, Shi, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6486549/
https://www.ncbi.nlm.nih.gov/pubmed/30783653
http://dx.doi.org/10.1093/nar/gkz114
_version_ 1783414358691282944
author Liao, Yang
Smyth, Gordon K
Shi, Wei
author_facet Liao, Yang
Smyth, Gordon K
Shi, Wei
author_sort Liao, Yang
collection PubMed
description We present Rsubread, a Bioconductor software package that provides high-performance alignment and read counting functions for RNA-seq reads. Rsubread is based on the successful Subread suite with the added ease-of-use of the R programming environment, creating a matrix of read counts directly as an R object ready for downstream analysis. It integrates read mapping and quantification in a single package and has no software dependencies other than R itself. We demonstrate Rsubread’s ability to detect exon–exon junctions de novo and to quantify expression at the level of either genes, exons or exon junctions. The resulting read counts can be input directly into a wide range of downstream statistical analyses using other Bioconductor packages. Using SEQC data and simulations, we compare Rsubread to TopHat2, STAR and HTSeq as well as to counting functions in the Bioconductor infrastructure packages. We consider the performance of these tools on the combined quantification task starting from raw sequence reads through to summary counts, and in particular evaluate the performance of different combinations of alignment and counting algorithms. We show that Rsubread is faster and uses less memory than competitor tools and produces read count summaries that more accurately correlate with true values.
format Online
Article
Text
id pubmed-6486549
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-64865492019-05-01 The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads Liao, Yang Smyth, Gordon K Shi, Wei Nucleic Acids Res Methods Online We present Rsubread, a Bioconductor software package that provides high-performance alignment and read counting functions for RNA-seq reads. Rsubread is based on the successful Subread suite with the added ease-of-use of the R programming environment, creating a matrix of read counts directly as an R object ready for downstream analysis. It integrates read mapping and quantification in a single package and has no software dependencies other than R itself. We demonstrate Rsubread’s ability to detect exon–exon junctions de novo and to quantify expression at the level of either genes, exons or exon junctions. The resulting read counts can be input directly into a wide range of downstream statistical analyses using other Bioconductor packages. Using SEQC data and simulations, we compare Rsubread to TopHat2, STAR and HTSeq as well as to counting functions in the Bioconductor infrastructure packages. We consider the performance of these tools on the combined quantification task starting from raw sequence reads through to summary counts, and in particular evaluate the performance of different combinations of alignment and counting algorithms. We show that Rsubread is faster and uses less memory than competitor tools and produces read count summaries that more accurately correlate with true values. Oxford University Press 2019-05-07 2019-02-20 /pmc/articles/PMC6486549/ /pubmed/30783653 http://dx.doi.org/10.1093/nar/gkz114 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Liao, Yang
Smyth, Gordon K
Shi, Wei
The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads
title The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads
title_full The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads
title_fullStr The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads
title_full_unstemmed The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads
title_short The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads
title_sort r package rsubread is easier, faster, cheaper and better for alignment and quantification of rna sequencing reads
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6486549/
https://www.ncbi.nlm.nih.gov/pubmed/30783653
http://dx.doi.org/10.1093/nar/gkz114
work_keys_str_mv AT liaoyang therpackagersubreadiseasierfastercheaperandbetterforalignmentandquantificationofrnasequencingreads
AT smythgordonk therpackagersubreadiseasierfastercheaperandbetterforalignmentandquantificationofrnasequencingreads
AT shiwei therpackagersubreadiseasierfastercheaperandbetterforalignmentandquantificationofrnasequencingreads
AT liaoyang rpackagersubreadiseasierfastercheaperandbetterforalignmentandquantificationofrnasequencingreads
AT smythgordonk rpackagersubreadiseasierfastercheaperandbetterforalignmentandquantificationofrnasequencingreads
AT shiwei rpackagersubreadiseasierfastercheaperandbetterforalignmentandquantificationofrnasequencingreads