Cargando…

DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data

RNA sequencing (RNA-seq) has become a widely used technology for analyzing global gene-expression changes during certain biological processes. It is generally acknowledged that RNA-seq data displays equidispersion and overdispersion characteristics; therefore, most RNA-seq analysis methods were deve...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Zhihua, Zhao, Zhiying, Tang, Wenqiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6284200/
https://www.ncbi.nlm.nih.gov/pubmed/30559761
http://dx.doi.org/10.3389/fgene.2018.00588
_version_ 1783379295817695232
author Gao, Zhihua
Zhao, Zhiying
Tang, Wenqiang
author_facet Gao, Zhihua
Zhao, Zhiying
Tang, Wenqiang
author_sort Gao, Zhihua
collection PubMed
description RNA sequencing (RNA-seq) has become a widely used technology for analyzing global gene-expression changes during certain biological processes. It is generally acknowledged that RNA-seq data displays equidispersion and overdispersion characteristics; therefore, most RNA-seq analysis methods were developed based on a negative binomial model capable of capturing both equidispersed and overdispersed data. In this study, we reported that in addition to equidispersion and overdispersion, RNA-seq data also displays underdispersion characteristics that cannot be adequately captured by general RNA-seq analysis methods. Based on a double Poisson model capable of capturing all data characteristics, we developed a new RNA-seq analysis method (DREAMSeq). Comparison of DREAMSeq with five other frequently used RNA-seq analysis methods using simulated datasets showed that its performance was comparable to or exceeded that of other methods in terms of type I error rate, statistical power, receiver operating characteristics (ROC) curve, area under the ROC curve, precision-recall curve, and the ability to detect the number of differentially expressed genes, especially in situations involving underdispersion. These results were validated by quantitative real-time polymerase chain reaction using a real Foxtail dataset. Our findings demonstrated DREAMSeq as a reliable, robust, and powerful new method for RNA-seq data mining. The DREAMSeq R package is available at http://tanglab.hebtu.edu.cn/tanglab/Home/DREAMSeq.
format Online
Article
Text
id pubmed-6284200
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-62842002018-12-17 DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data Gao, Zhihua Zhao, Zhiying Tang, Wenqiang Front Genet Genetics RNA sequencing (RNA-seq) has become a widely used technology for analyzing global gene-expression changes during certain biological processes. It is generally acknowledged that RNA-seq data displays equidispersion and overdispersion characteristics; therefore, most RNA-seq analysis methods were developed based on a negative binomial model capable of capturing both equidispersed and overdispersed data. In this study, we reported that in addition to equidispersion and overdispersion, RNA-seq data also displays underdispersion characteristics that cannot be adequately captured by general RNA-seq analysis methods. Based on a double Poisson model capable of capturing all data characteristics, we developed a new RNA-seq analysis method (DREAMSeq). Comparison of DREAMSeq with five other frequently used RNA-seq analysis methods using simulated datasets showed that its performance was comparable to or exceeded that of other methods in terms of type I error rate, statistical power, receiver operating characteristics (ROC) curve, area under the ROC curve, precision-recall curve, and the ability to detect the number of differentially expressed genes, especially in situations involving underdispersion. These results were validated by quantitative real-time polymerase chain reaction using a real Foxtail dataset. Our findings demonstrated DREAMSeq as a reliable, robust, and powerful new method for RNA-seq data mining. The DREAMSeq R package is available at http://tanglab.hebtu.edu.cn/tanglab/Home/DREAMSeq. Frontiers Media S.A. 2018-11-30 /pmc/articles/PMC6284200/ /pubmed/30559761 http://dx.doi.org/10.3389/fgene.2018.00588 Text en Copyright © 2018 Gao, Zhao and Tang. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Gao, Zhihua
Zhao, Zhiying
Tang, Wenqiang
DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data
title DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data
title_full DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data
title_fullStr DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data
title_full_unstemmed DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data
title_short DREAMSeq: An Improved Method for Analyzing Differentially Expressed Genes in RNA-seq Data
title_sort dreamseq: an improved method for analyzing differentially expressed genes in rna-seq data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6284200/
https://www.ncbi.nlm.nih.gov/pubmed/30559761
http://dx.doi.org/10.3389/fgene.2018.00588
work_keys_str_mv AT gaozhihua dreamseqanimprovedmethodforanalyzingdifferentiallyexpressedgenesinrnaseqdata
AT zhaozhiying dreamseqanimprovedmethodforanalyzingdifferentiallyexpressedgenesinrnaseqdata
AT tangwenqiang dreamseqanimprovedmethodforanalyzingdifferentiallyexpressedgenesinrnaseqdata