Cargando…
DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data
RNA sequencing (RNA-seq) has become a widely used technology for analyzing global gene-expression changes during certain biological processes. It is generally acknowledged that RNA-seq data displays equidispersion and overdispersion characteristics; therefore, most RNA-seq analysis methods were deve...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6284200/ https://www.ncbi.nlm.nih.gov/pubmed/30559761 http://dx.doi.org/10.3389/fgene.2018.00588 |
_version_ | 1783379295817695232 |
---|---|
author | Gao, Zhihua Zhao, Zhiying Tang, Wenqiang |
author_facet | Gao, Zhihua Zhao, Zhiying Tang, Wenqiang |
author_sort | Gao, Zhihua |
collection | PubMed |
description | RNA sequencing (RNA-seq) has become a widely used technology for analyzing global gene-expression changes during certain biological processes. It is generally acknowledged that RNA-seq data displays equidispersion and overdispersion characteristics; therefore, most RNA-seq analysis methods were developed based on a negative binomial model capable of capturing both equidispersed and overdispersed data. In this study, we reported that in addition to equidispersion and overdispersion, RNA-seq data also displays underdispersion characteristics that cannot be adequately captured by general RNA-seq analysis methods. Based on a double Poisson model capable of capturing all data characteristics, we developed a new RNA-seq analysis method (DREAMSeq). Comparison of DREAMSeq with five other frequently used RNA-seq analysis methods using simulated datasets showed that its performance was comparable to or exceeded that of other methods in terms of type I error rate, statistical power, receiver operating characteristics (ROC) curve, area under the ROC curve, precision-recall curve, and the ability to detect the number of differentially expressed genes, especially in situations involving underdispersion. These results were validated by quantitative real-time polymerase chain reaction using a real Foxtail dataset. Our findings demonstrated DREAMSeq as a reliable, robust, and powerful new method for RNA-seq data mining. The DREAMSeq R package is available at http://tanglab.hebtu.edu.cn/tanglab/Home/DREAMSeq. |
format | Online Article Text |
id | pubmed-6284200 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-62842002018-12-17 DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data Gao, Zhihua Zhao, Zhiying Tang, Wenqiang Front Genet Genetics RNA sequencing (RNA-seq) has become a widely used technology for analyzing global gene-expression changes during certain biological processes. It is generally acknowledged that RNA-seq data displays equidispersion and overdispersion characteristics; therefore, most RNA-seq analysis methods were developed based on a negative binomial model capable of capturing both equidispersed and overdispersed data. In this study, we reported that in addition to equidispersion and overdispersion, RNA-seq data also displays underdispersion characteristics that cannot be adequately captured by general RNA-seq analysis methods. Based on a double Poisson model capable of capturing all data characteristics, we developed a new RNA-seq analysis method (DREAMSeq). Comparison of DREAMSeq with five other frequently used RNA-seq analysis methods using simulated datasets showed that its performance was comparable to or exceeded that of other methods in terms of type I error rate, statistical power, receiver operating characteristics (ROC) curve, area under the ROC curve, precision-recall curve, and the ability to detect the number of differentially expressed genes, especially in situations involving underdispersion. These results were validated by quantitative real-time polymerase chain reaction using a real Foxtail dataset. Our findings demonstrated DREAMSeq as a reliable, robust, and powerful new method for RNA-seq data mining. The DREAMSeq R package is available at http://tanglab.hebtu.edu.cn/tanglab/Home/DREAMSeq. Frontiers Media S.A. 2018-11-30 /pmc/articles/PMC6284200/ /pubmed/30559761 http://dx.doi.org/10.3389/fgene.2018.00588 Text en Copyright © 2018 Gao, Zhao and Tang. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Gao, Zhihua Zhao, Zhiying Tang, Wenqiang DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data |
title | DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data |
title_full | DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data |
title_fullStr | DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data |
title_full_unstemmed | DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data |
title_short | DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data |
title_sort | dreamseq: an improved method for analyzing differentially expressed genes in rna-seq data |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6284200/ https://www.ncbi.nlm.nih.gov/pubmed/30559761 http://dx.doi.org/10.3389/fgene.2018.00588 |
work_keys_str_mv | AT gaozhihua dreamseqanimprovedmethodforanalyzingdifferentiallyexpressedgenesinrnaseqdata AT zhaozhiying dreamseqanimprovedmethodforanalyzingdifferentiallyexpressedgenesinrnaseqdata AT tangwenqiang dreamseqanimprovedmethodforanalyzingdifferentiallyexpressedgenesinrnaseqdata |