Cargando…
A Comparative Study of Techniques for Differential Expression Analysis on RNA-Seq Data
Recent advances in next-generation sequencing technology allow high-throughput cDNA sequencing (RNA-Seq) to be widely applied in transcriptomic studies, in particular for detecting differentially expressed genes between groups. Many software packages have been developed for the identification of dif...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4132098/ https://www.ncbi.nlm.nih.gov/pubmed/25119138 http://dx.doi.org/10.1371/journal.pone.0103207 |
_version_ | 1782330568214577152 |
---|---|
author | Zhang, Zong Hong Jhaveri, Dhanisha J. Marshall, Vikki M. Bauer, Denis C. Edson, Janette Narayanan, Ramesh K. Robinson, Gregory J. Lundberg, Andreas E. Bartlett, Perry F. Wray, Naomi R. Zhao, Qiong-Yi |
author_facet | Zhang, Zong Hong Jhaveri, Dhanisha J. Marshall, Vikki M. Bauer, Denis C. Edson, Janette Narayanan, Ramesh K. Robinson, Gregory J. Lundberg, Andreas E. Bartlett, Perry F. Wray, Naomi R. Zhao, Qiong-Yi |
author_sort | Zhang, Zong Hong |
collection | PubMed |
description | Recent advances in next-generation sequencing technology allow high-throughput cDNA sequencing (RNA-Seq) to be widely applied in transcriptomic studies, in particular for detecting differentially expressed genes between groups. Many software packages have been developed for the identification of differentially expressed genes (DEGs) between treatment groups based on RNA-Seq data. However, there is a lack of consensus on how to approach an optimal study design and choice of suitable software for the analysis. In this comparative study we evaluate the performance of three of the most frequently used software tools: Cufflinks-Cuffdiff2, DESeq and edgeR. A number of important parameters of RNA-Seq technology were taken into consideration, including the number of replicates, sequencing depth, and balanced vs. unbalanced sequencing depth within and between groups. We benchmarked results relative to sets of DEGs identified through either quantitative RT-PCR or microarray. We observed that edgeR performs slightly better than DESeq and Cuffdiff2 in terms of the ability to uncover true positives. Overall, DESeq or taking the intersection of DEGs from two or more tools is recommended if the number of false positives is a major concern in the study. In other circumstances, edgeR is slightly preferable for differential expression analysis at the expense of potentially introducing more false positives. |
format | Online Article Text |
id | pubmed-4132098 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-41320982014-08-19 A Comparative Study of Techniques for Differential Expression Analysis on RNA-Seq Data Zhang, Zong Hong Jhaveri, Dhanisha J. Marshall, Vikki M. Bauer, Denis C. Edson, Janette Narayanan, Ramesh K. Robinson, Gregory J. Lundberg, Andreas E. Bartlett, Perry F. Wray, Naomi R. Zhao, Qiong-Yi PLoS One Research Article Recent advances in next-generation sequencing technology allow high-throughput cDNA sequencing (RNA-Seq) to be widely applied in transcriptomic studies, in particular for detecting differentially expressed genes between groups. Many software packages have been developed for the identification of differentially expressed genes (DEGs) between treatment groups based on RNA-Seq data. However, there is a lack of consensus on how to approach an optimal study design and choice of suitable software for the analysis. In this comparative study we evaluate the performance of three of the most frequently used software tools: Cufflinks-Cuffdiff2, DESeq and edgeR. A number of important parameters of RNA-Seq technology were taken into consideration, including the number of replicates, sequencing depth, and balanced vs. unbalanced sequencing depth within and between groups. We benchmarked results relative to sets of DEGs identified through either quantitative RT-PCR or microarray. We observed that edgeR performs slightly better than DESeq and Cuffdiff2 in terms of the ability to uncover true positives. Overall, DESeq or taking the intersection of DEGs from two or more tools is recommended if the number of false positives is a major concern in the study. In other circumstances, edgeR is slightly preferable for differential expression analysis at the expense of potentially introducing more false positives. Public Library of Science 2014-08-13 /pmc/articles/PMC4132098/ /pubmed/25119138 http://dx.doi.org/10.1371/journal.pone.0103207 Text en © 2014 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Zhang, Zong Hong Jhaveri, Dhanisha J. Marshall, Vikki M. Bauer, Denis C. Edson, Janette Narayanan, Ramesh K. Robinson, Gregory J. Lundberg, Andreas E. Bartlett, Perry F. Wray, Naomi R. Zhao, Qiong-Yi A Comparative Study of Techniques for Differential Expression Analysis on RNA-Seq Data |
title | A Comparative Study of Techniques for Differential Expression Analysis on RNA-Seq Data |
title_full | A Comparative Study of Techniques for Differential Expression Analysis on RNA-Seq Data |
title_fullStr | A Comparative Study of Techniques for Differential Expression Analysis on RNA-Seq Data |
title_full_unstemmed | A Comparative Study of Techniques for Differential Expression Analysis on RNA-Seq Data |
title_short | A Comparative Study of Techniques for Differential Expression Analysis on RNA-Seq Data |
title_sort | comparative study of techniques for differential expression analysis on rna-seq data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4132098/ https://www.ncbi.nlm.nih.gov/pubmed/25119138 http://dx.doi.org/10.1371/journal.pone.0103207 |
work_keys_str_mv | AT zhangzonghong acomparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT jhaveridhanishaj acomparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT marshallvikkim acomparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT bauerdenisc acomparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT edsonjanette acomparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT narayananrameshk acomparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT robinsongregoryj acomparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT lundbergandrease acomparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT bartlettperryf acomparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT wraynaomir acomparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT zhaoqiongyi acomparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT zhangzonghong comparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT jhaveridhanishaj comparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT marshallvikkim comparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT bauerdenisc comparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT edsonjanette comparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT narayananrameshk comparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT robinsongregoryj comparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT lundbergandrease comparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT bartlettperryf comparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT wraynaomir comparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata AT zhaoqiongyi comparativestudyoftechniquesfordifferentialexpressionanalysisonrnaseqdata |