Cargando…

A comparison of methods for differential expression analysis of RNA-seq data

BACKGROUND: Finding genes that are differentially expressed between conditions is an integral part of understanding the molecular basis of phenotypic variation. In the past decades, DNA microarrays have been used extensively to quantify the abundance of mRNA corresponding to different genes, and mor...

Descripción completa

Detalles Bibliográficos
Autores principales: Soneson, Charlotte, Delorenzi, Mauro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3608160/
https://www.ncbi.nlm.nih.gov/pubmed/23497356
http://dx.doi.org/10.1186/1471-2105-14-91
_version_ 1782264196048617472
author Soneson, Charlotte
Delorenzi, Mauro
author_facet Soneson, Charlotte
Delorenzi, Mauro
author_sort Soneson, Charlotte
collection PubMed
description BACKGROUND: Finding genes that are differentially expressed between conditions is an integral part of understanding the molecular basis of phenotypic variation. In the past decades, DNA microarrays have been used extensively to quantify the abundance of mRNA corresponding to different genes, and more recently high-throughput sequencing of cDNA (RNA-seq) has emerged as a powerful competitor. As the cost of sequencing decreases, it is conceivable that the use of RNA-seq for differential expression analysis will increase rapidly. To exploit the possibilities and address the challenges posed by this relatively new type of data, a number of software packages have been developed especially for differential expression analysis of RNA-seq data. RESULTS: We conducted an extensive comparison of eleven methods for differential expression analysis of RNA-seq data. All methods are freely available within the R framework and take as input a matrix of counts, i.e. the number of reads mapping to each genomic feature of interest in each of a number of samples. We evaluate the methods based on both simulated data and real RNA-seq data. CONCLUSIONS: Very small sample sizes, which are still common in RNA-seq experiments, impose problems for all evaluated methods and any results obtained under such conditions should be interpreted with caution. For larger sample sizes, the methods combining a variance-stabilizing transformation with the ‘limma’ method for differential expression analysis perform well under many different conditions, as does the nonparametric SAMseq method.
format Online
Article
Text
id pubmed-3608160
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36081602013-03-29 A comparison of methods for differential expression analysis of RNA-seq data Soneson, Charlotte Delorenzi, Mauro BMC Bioinformatics Research Article BACKGROUND: Finding genes that are differentially expressed between conditions is an integral part of understanding the molecular basis of phenotypic variation. In the past decades, DNA microarrays have been used extensively to quantify the abundance of mRNA corresponding to different genes, and more recently high-throughput sequencing of cDNA (RNA-seq) has emerged as a powerful competitor. As the cost of sequencing decreases, it is conceivable that the use of RNA-seq for differential expression analysis will increase rapidly. To exploit the possibilities and address the challenges posed by this relatively new type of data, a number of software packages have been developed especially for differential expression analysis of RNA-seq data. RESULTS: We conducted an extensive comparison of eleven methods for differential expression analysis of RNA-seq data. All methods are freely available within the R framework and take as input a matrix of counts, i.e. the number of reads mapping to each genomic feature of interest in each of a number of samples. We evaluate the methods based on both simulated data and real RNA-seq data. CONCLUSIONS: Very small sample sizes, which are still common in RNA-seq experiments, impose problems for all evaluated methods and any results obtained under such conditions should be interpreted with caution. For larger sample sizes, the methods combining a variance-stabilizing transformation with the ‘limma’ method for differential expression analysis perform well under many different conditions, as does the nonparametric SAMseq method. BioMed Central 2013-03-09 /pmc/articles/PMC3608160/ /pubmed/23497356 http://dx.doi.org/10.1186/1471-2105-14-91 Text en Copyright ©2013 Soneson and Delorenzi; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Soneson, Charlotte
Delorenzi, Mauro
A comparison of methods for differential expression analysis of RNA-seq data
title A comparison of methods for differential expression analysis of RNA-seq data
title_full A comparison of methods for differential expression analysis of RNA-seq data
title_fullStr A comparison of methods for differential expression analysis of RNA-seq data
title_full_unstemmed A comparison of methods for differential expression analysis of RNA-seq data
title_short A comparison of methods for differential expression analysis of RNA-seq data
title_sort comparison of methods for differential expression analysis of rna-seq data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3608160/
https://www.ncbi.nlm.nih.gov/pubmed/23497356
http://dx.doi.org/10.1186/1471-2105-14-91
work_keys_str_mv AT sonesoncharlotte acomparisonofmethodsfordifferentialexpressionanalysisofrnaseqdata
AT delorenzimauro acomparisonofmethodsfordifferentialexpressionanalysisofrnaseqdata
AT sonesoncharlotte comparisonofmethodsfordifferentialexpressionanalysisofrnaseqdata
AT delorenzimauro comparisonofmethodsfordifferentialexpressionanalysisofrnaseqdata