Cargando…

RNAontheBENCH: computational and empirical resources for benchmarking RNAseq quantification and differential expression methods

RNA sequencing (RNAseq) has become the method of choice for transcriptome analysis, yet no consensus exists as to the most appropriate pipeline for its analysis, with current benchmarks suffering important limitations. Here, we address these challenges through a rich benchmarking resource harnessing...

Descripción completa

Detalles Bibliográficos
Autores principales: Germain, Pierre-Luc, Vitriolo, Alessandro, Adamo, Antonio, Laise, Pasquale, Das, Vivek, Testa, Giuseppe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4914128/
https://www.ncbi.nlm.nih.gov/pubmed/27190234
http://dx.doi.org/10.1093/nar/gkw448
_version_ 1782438516192444416
author Germain, Pierre-Luc
Vitriolo, Alessandro
Adamo, Antonio
Laise, Pasquale
Das, Vivek
Testa, Giuseppe
author_facet Germain, Pierre-Luc
Vitriolo, Alessandro
Adamo, Antonio
Laise, Pasquale
Das, Vivek
Testa, Giuseppe
author_sort Germain, Pierre-Luc
collection PubMed
description RNA sequencing (RNAseq) has become the method of choice for transcriptome analysis, yet no consensus exists as to the most appropriate pipeline for its analysis, with current benchmarks suffering important limitations. Here, we address these challenges through a rich benchmarking resource harnessing (i) two RNAseq datasets including ERCC ExFold spike-ins; (ii) Nanostring measurements of a panel of 150 genes on the same samples; (iii) a set of internal, genetically-determined controls; (iv) a reanalysis of the SEQC dataset; and (v) a focus on relative quantification (i.e. across-samples). We use this resource to compare different approaches to each step of RNAseq analysis, from alignment to differential expression testing. We show that methods providing the best absolute quantification do not necessarily provide good relative quantification across samples, that count-based methods are superior for gene-level relative quantification, and that the new generation of pseudo-alignment-based software performs as well as established methods, at a fraction of the computing time. We also assess the impact of library type and size on quantification and differential expression analysis. Finally, we have created a R package and a web platform to enable the simple and streamlined application of this resource to the benchmarking of future methods.
format Online
Article
Text
id pubmed-4914128
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-49141282016-06-22 RNAontheBENCH: computational and empirical resources for benchmarking RNAseq quantification and differential expression methods Germain, Pierre-Luc Vitriolo, Alessandro Adamo, Antonio Laise, Pasquale Das, Vivek Testa, Giuseppe Nucleic Acids Res Computational Biology RNA sequencing (RNAseq) has become the method of choice for transcriptome analysis, yet no consensus exists as to the most appropriate pipeline for its analysis, with current benchmarks suffering important limitations. Here, we address these challenges through a rich benchmarking resource harnessing (i) two RNAseq datasets including ERCC ExFold spike-ins; (ii) Nanostring measurements of a panel of 150 genes on the same samples; (iii) a set of internal, genetically-determined controls; (iv) a reanalysis of the SEQC dataset; and (v) a focus on relative quantification (i.e. across-samples). We use this resource to compare different approaches to each step of RNAseq analysis, from alignment to differential expression testing. We show that methods providing the best absolute quantification do not necessarily provide good relative quantification across samples, that count-based methods are superior for gene-level relative quantification, and that the new generation of pseudo-alignment-based software performs as well as established methods, at a fraction of the computing time. We also assess the impact of library type and size on quantification and differential expression analysis. Finally, we have created a R package and a web platform to enable the simple and streamlined application of this resource to the benchmarking of future methods. Oxford University Press 2016-06-20 2016-05-17 /pmc/articles/PMC4914128/ /pubmed/27190234 http://dx.doi.org/10.1093/nar/gkw448 Text en © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Germain, Pierre-Luc
Vitriolo, Alessandro
Adamo, Antonio
Laise, Pasquale
Das, Vivek
Testa, Giuseppe
RNAontheBENCH: computational and empirical resources for benchmarking RNAseq quantification and differential expression methods
title RNAontheBENCH: computational and empirical resources for benchmarking RNAseq quantification and differential expression methods
title_full RNAontheBENCH: computational and empirical resources for benchmarking RNAseq quantification and differential expression methods
title_fullStr RNAontheBENCH: computational and empirical resources for benchmarking RNAseq quantification and differential expression methods
title_full_unstemmed RNAontheBENCH: computational and empirical resources for benchmarking RNAseq quantification and differential expression methods
title_short RNAontheBENCH: computational and empirical resources for benchmarking RNAseq quantification and differential expression methods
title_sort rnaonthebench: computational and empirical resources for benchmarking rnaseq quantification and differential expression methods
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4914128/
https://www.ncbi.nlm.nih.gov/pubmed/27190234
http://dx.doi.org/10.1093/nar/gkw448
work_keys_str_mv AT germainpierreluc rnaonthebenchcomputationalandempiricalresourcesforbenchmarkingrnaseqquantificationanddifferentialexpressionmethods
AT vitrioloalessandro rnaonthebenchcomputationalandempiricalresourcesforbenchmarkingrnaseqquantificationanddifferentialexpressionmethods
AT adamoantonio rnaonthebenchcomputationalandempiricalresourcesforbenchmarkingrnaseqquantificationanddifferentialexpressionmethods
AT laisepasquale rnaonthebenchcomputationalandempiricalresourcesforbenchmarkingrnaseqquantificationanddifferentialexpressionmethods
AT dasvivek rnaonthebenchcomputationalandempiricalresourcesforbenchmarkingrnaseqquantificationanddifferentialexpressionmethods
AT testagiuseppe rnaonthebenchcomputationalandempiricalresourcesforbenchmarkingrnaseqquantificationanddifferentialexpressionmethods