Cargando…
Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization
To improve the applicability of RNA-seq technology, a large number of RNA-seq data analysis methods and correction algorithms have been developed. Although these new methods and algorithms have steadily improved transcriptome analysis, greater prediction accuracy is needed to better guide experiment...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4538581/ https://www.ncbi.nlm.nih.gov/pubmed/26339642 http://dx.doi.org/10.1155/2015/789516 |
_version_ | 1782386010456326144 |
---|---|
author | Dou, Yongchao Guo, Xiaomei Yuan, Lingling Holding, David R. Zhang, Chi |
author_facet | Dou, Yongchao Guo, Xiaomei Yuan, Lingling Holding, David R. Zhang, Chi |
author_sort | Dou, Yongchao |
collection | PubMed |
description | To improve the applicability of RNA-seq technology, a large number of RNA-seq data analysis methods and correction algorithms have been developed. Although these new methods and algorithms have steadily improved transcriptome analysis, greater prediction accuracy is needed to better guide experimental designs with computational results. In this study, a new tool for the identification of differentially expressed genes with RNA-seq data, named GExposer, was developed. This tool introduces a local normalization algorithm to reduce the bias of nonrandomly positioned read depth. The naive Bayes classifier is employed to integrate fold change, transcript length, and GC content to identify differentially expressed genes. Results on several independent tests show that GExposer has better performance than other methods. The combination of the local normalization algorithm and naive Bayes classifier with three attributes can achieve better results; both false positive rates and false negative rates are reduced. However, only a small portion of genes is affected by the local normalization and GC content correction. |
format | Online Article Text |
id | pubmed-4538581 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-45385812015-09-03 Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization Dou, Yongchao Guo, Xiaomei Yuan, Lingling Holding, David R. Zhang, Chi Biomed Res Int Research Article To improve the applicability of RNA-seq technology, a large number of RNA-seq data analysis methods and correction algorithms have been developed. Although these new methods and algorithms have steadily improved transcriptome analysis, greater prediction accuracy is needed to better guide experimental designs with computational results. In this study, a new tool for the identification of differentially expressed genes with RNA-seq data, named GExposer, was developed. This tool introduces a local normalization algorithm to reduce the bias of nonrandomly positioned read depth. The naive Bayes classifier is employed to integrate fold change, transcript length, and GC content to identify differentially expressed genes. Results on several independent tests show that GExposer has better performance than other methods. The combination of the local normalization algorithm and naive Bayes classifier with three attributes can achieve better results; both false positive rates and false negative rates are reduced. However, only a small portion of genes is affected by the local normalization and GC content correction. Hindawi Publishing Corporation 2015 2015-08-03 /pmc/articles/PMC4538581/ /pubmed/26339642 http://dx.doi.org/10.1155/2015/789516 Text en Copyright © 2015 Yongchao Dou et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Dou, Yongchao Guo, Xiaomei Yuan, Lingling Holding, David R. Zhang, Chi Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization |
title | Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization |
title_full | Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization |
title_fullStr | Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization |
title_full_unstemmed | Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization |
title_short | Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization |
title_sort | differential expression analysis in rna-seq by a naive bayes classifier with local normalization |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4538581/ https://www.ncbi.nlm.nih.gov/pubmed/26339642 http://dx.doi.org/10.1155/2015/789516 |
work_keys_str_mv | AT douyongchao differentialexpressionanalysisinrnaseqbyanaivebayesclassifierwithlocalnormalization AT guoxiaomei differentialexpressionanalysisinrnaseqbyanaivebayesclassifierwithlocalnormalization AT yuanlingling differentialexpressionanalysisinrnaseqbyanaivebayesclassifierwithlocalnormalization AT holdingdavidr differentialexpressionanalysisinrnaseqbyanaivebayesclassifierwithlocalnormalization AT zhangchi differentialexpressionanalysisinrnaseqbyanaivebayesclassifierwithlocalnormalization |