Cargando…
The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads
We present Rsubread, a Bioconductor software package that provides high-performance alignment and read counting functions for RNA-seq reads. Rsubread is based on the successful Subread suite with the added ease-of-use of the R programming environment, creating a matrix of read counts directly as an...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6486549/ https://www.ncbi.nlm.nih.gov/pubmed/30783653 http://dx.doi.org/10.1093/nar/gkz114 |
_version_ | 1783414358691282944 |
---|---|
author | Liao, Yang Smyth, Gordon K Shi, Wei |
author_facet | Liao, Yang Smyth, Gordon K Shi, Wei |
author_sort | Liao, Yang |
collection | PubMed |
description | We present Rsubread, a Bioconductor software package that provides high-performance alignment and read counting functions for RNA-seq reads. Rsubread is based on the successful Subread suite with the added ease-of-use of the R programming environment, creating a matrix of read counts directly as an R object ready for downstream analysis. It integrates read mapping and quantification in a single package and has no software dependencies other than R itself. We demonstrate Rsubread’s ability to detect exon–exon junctions de novo and to quantify expression at the level of either genes, exons or exon junctions. The resulting read counts can be input directly into a wide range of downstream statistical analyses using other Bioconductor packages. Using SEQC data and simulations, we compare Rsubread to TopHat2, STAR and HTSeq as well as to counting functions in the Bioconductor infrastructure packages. We consider the performance of these tools on the combined quantification task starting from raw sequence reads through to summary counts, and in particular evaluate the performance of different combinations of alignment and counting algorithms. We show that Rsubread is faster and uses less memory than competitor tools and produces read count summaries that more accurately correlate with true values. |
format | Online Article Text |
id | pubmed-6486549 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-64865492019-05-01 The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads Liao, Yang Smyth, Gordon K Shi, Wei Nucleic Acids Res Methods Online We present Rsubread, a Bioconductor software package that provides high-performance alignment and read counting functions for RNA-seq reads. Rsubread is based on the successful Subread suite with the added ease-of-use of the R programming environment, creating a matrix of read counts directly as an R object ready for downstream analysis. It integrates read mapping and quantification in a single package and has no software dependencies other than R itself. We demonstrate Rsubread’s ability to detect exon–exon junctions de novo and to quantify expression at the level of either genes, exons or exon junctions. The resulting read counts can be input directly into a wide range of downstream statistical analyses using other Bioconductor packages. Using SEQC data and simulations, we compare Rsubread to TopHat2, STAR and HTSeq as well as to counting functions in the Bioconductor infrastructure packages. We consider the performance of these tools on the combined quantification task starting from raw sequence reads through to summary counts, and in particular evaluate the performance of different combinations of alignment and counting algorithms. We show that Rsubread is faster and uses less memory than competitor tools and produces read count summaries that more accurately correlate with true values. Oxford University Press 2019-05-07 2019-02-20 /pmc/articles/PMC6486549/ /pubmed/30783653 http://dx.doi.org/10.1093/nar/gkz114 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Liao, Yang Smyth, Gordon K Shi, Wei The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads |
title | The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads |
title_full | The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads |
title_fullStr | The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads |
title_full_unstemmed | The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads |
title_short | The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads |
title_sort | r package rsubread is easier, faster, cheaper and better for alignment and quantification of rna sequencing reads |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6486549/ https://www.ncbi.nlm.nih.gov/pubmed/30783653 http://dx.doi.org/10.1093/nar/gkz114 |
work_keys_str_mv | AT liaoyang therpackagersubreadiseasierfastercheaperandbetterforalignmentandquantificationofrnasequencingreads AT smythgordonk therpackagersubreadiseasierfastercheaperandbetterforalignmentandquantificationofrnasequencingreads AT shiwei therpackagersubreadiseasierfastercheaperandbetterforalignmentandquantificationofrnasequencingreads AT liaoyang rpackagersubreadiseasierfastercheaperandbetterforalignmentandquantificationofrnasequencingreads AT smythgordonk rpackagersubreadiseasierfastercheaperandbetterforalignmentandquantificationofrnasequencingreads AT shiwei rpackagersubreadiseasierfastercheaperandbetterforalignmentandquantificationofrnasequencingreads |