Cargando…

How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use?

RNA-seq is now the technology of choice for genome-wide differential gene expression experiments, but it is not clear how many biological replicates are needed to ensure valid biological interpretation of the results or which statistical tools are best for analyzing the data. An RNA-seq experiment w...

Descripción completa

Detalles Bibliográficos
Autores principales: Schurch, Nicholas J., Schofield, Pietá, Gierliński, Marek, Cole, Christian, Sherstnev, Alexander, Singh, Vijender, Wrobel, Nicola, Gharbi, Karim, Simpson, Gordon G., Owen-Hughes, Tom, Blaxter, Mark, Barton, Geoffrey J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4878611/
https://www.ncbi.nlm.nih.gov/pubmed/27022035
http://dx.doi.org/10.1261/rna.053959.115
_version_ 1782433585615077376
author Schurch, Nicholas J.
Schofield, Pietá
Gierliński, Marek
Cole, Christian
Sherstnev, Alexander
Singh, Vijender
Wrobel, Nicola
Gharbi, Karim
Simpson, Gordon G.
Owen-Hughes, Tom
Blaxter, Mark
Barton, Geoffrey J.
author_facet Schurch, Nicholas J.
Schofield, Pietá
Gierliński, Marek
Cole, Christian
Sherstnev, Alexander
Singh, Vijender
Wrobel, Nicola
Gharbi, Karim
Simpson, Gordon G.
Owen-Hughes, Tom
Blaxter, Mark
Barton, Geoffrey J.
author_sort Schurch, Nicholas J.
collection PubMed
description RNA-seq is now the technology of choice for genome-wide differential gene expression experiments, but it is not clear how many biological replicates are needed to ensure valid biological interpretation of the results or which statistical tools are best for analyzing the data. An RNA-seq experiment with 48 biological replicates in each of two conditions was performed to answer these questions and provide guidelines for experimental design. With three biological replicates, nine of the 11 tools evaluated found only 20%–40% of the significantly differentially expressed (SDE) genes identified with the full set of 42 clean replicates. This rises to >85% for the subset of SDE genes changing in expression by more than fourfold. To achieve >85% for all SDE genes regardless of fold change requires more than 20 biological replicates. The same nine tools successfully control their false discovery rate at ≲5% for all numbers of replicates, while the remaining two tools fail to control their FDR adequately, particularly for low numbers of replicates. For future RNA-seq experiments, these results suggest that at least six biological replicates should be used, rising to at least 12 when it is important to identify SDE genes for all fold changes. If fewer than 12 replicates are used, a superior combination of true positive and false positive performances makes edgeR and DESeq2 the leading tools. For higher replicate numbers, minimizing false positives is more important and DESeq marginally outperforms the other tools.
format Online
Article
Text
id pubmed-4878611
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-48786112016-06-07 How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use? Schurch, Nicholas J. Schofield, Pietá Gierliński, Marek Cole, Christian Sherstnev, Alexander Singh, Vijender Wrobel, Nicola Gharbi, Karim Simpson, Gordon G. Owen-Hughes, Tom Blaxter, Mark Barton, Geoffrey J. RNA Article RNA-seq is now the technology of choice for genome-wide differential gene expression experiments, but it is not clear how many biological replicates are needed to ensure valid biological interpretation of the results or which statistical tools are best for analyzing the data. An RNA-seq experiment with 48 biological replicates in each of two conditions was performed to answer these questions and provide guidelines for experimental design. With three biological replicates, nine of the 11 tools evaluated found only 20%–40% of the significantly differentially expressed (SDE) genes identified with the full set of 42 clean replicates. This rises to >85% for the subset of SDE genes changing in expression by more than fourfold. To achieve >85% for all SDE genes regardless of fold change requires more than 20 biological replicates. The same nine tools successfully control their false discovery rate at ≲5% for all numbers of replicates, while the remaining two tools fail to control their FDR adequately, particularly for low numbers of replicates. For future RNA-seq experiments, these results suggest that at least six biological replicates should be used, rising to at least 12 when it is important to identify SDE genes for all fold changes. If fewer than 12 replicates are used, a superior combination of true positive and false positive performances makes edgeR and DESeq2 the leading tools. For higher replicate numbers, minimizing false positives is more important and DESeq marginally outperforms the other tools. Cold Spring Harbor Laboratory Press 2016-06 /pmc/articles/PMC4878611/ /pubmed/27022035 http://dx.doi.org/10.1261/rna.053959.115 Text en © 2016 Schurch et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society http://creativecommons.org/licenses/by/4.0/ This article, published in RNA, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Schurch, Nicholas J.
Schofield, Pietá
Gierliński, Marek
Cole, Christian
Sherstnev, Alexander
Singh, Vijender
Wrobel, Nicola
Gharbi, Karim
Simpson, Gordon G.
Owen-Hughes, Tom
Blaxter, Mark
Barton, Geoffrey J.
How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use?
title How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use?
title_full How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use?
title_fullStr How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use?
title_full_unstemmed How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use?
title_short How many biological replicates are needed in an RNA-seq experiment and which differential expression tool should you use?
title_sort how many biological replicates are needed in an rna-seq experiment and which differential expression tool should you use?
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4878611/
https://www.ncbi.nlm.nih.gov/pubmed/27022035
http://dx.doi.org/10.1261/rna.053959.115
work_keys_str_mv AT schurchnicholasj howmanybiologicalreplicatesareneededinanrnaseqexperimentandwhichdifferentialexpressiontoolshouldyouuse
AT schofieldpieta howmanybiologicalreplicatesareneededinanrnaseqexperimentandwhichdifferentialexpressiontoolshouldyouuse
AT gierlinskimarek howmanybiologicalreplicatesareneededinanrnaseqexperimentandwhichdifferentialexpressiontoolshouldyouuse
AT colechristian howmanybiologicalreplicatesareneededinanrnaseqexperimentandwhichdifferentialexpressiontoolshouldyouuse
AT sherstnevalexander howmanybiologicalreplicatesareneededinanrnaseqexperimentandwhichdifferentialexpressiontoolshouldyouuse
AT singhvijender howmanybiologicalreplicatesareneededinanrnaseqexperimentandwhichdifferentialexpressiontoolshouldyouuse
AT wrobelnicola howmanybiologicalreplicatesareneededinanrnaseqexperimentandwhichdifferentialexpressiontoolshouldyouuse
AT gharbikarim howmanybiologicalreplicatesareneededinanrnaseqexperimentandwhichdifferentialexpressiontoolshouldyouuse
AT simpsongordong howmanybiologicalreplicatesareneededinanrnaseqexperimentandwhichdifferentialexpressiontoolshouldyouuse
AT owenhughestom howmanybiologicalreplicatesareneededinanrnaseqexperimentandwhichdifferentialexpressiontoolshouldyouuse
AT blaxtermark howmanybiologicalreplicatesareneededinanrnaseqexperimentandwhichdifferentialexpressiontoolshouldyouuse
AT bartongeoffreyj howmanybiologicalreplicatesareneededinanrnaseqexperimentandwhichdifferentialexpressiontoolshouldyouuse