Cargando…

Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization

To improve the applicability of RNA-seq technology, a large number of RNA-seq data analysis methods and correction algorithms have been developed. Although these new methods and algorithms have steadily improved transcriptome analysis, greater prediction accuracy is needed to better guide experiment...

Descripción completa

Detalles Bibliográficos
Autores principales: Dou, Yongchao, Guo, Xiaomei, Yuan, Lingling, Holding, David R., Zhang, Chi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4538581/
https://www.ncbi.nlm.nih.gov/pubmed/26339642
http://dx.doi.org/10.1155/2015/789516
_version_ 1782386010456326144
author Dou, Yongchao
Guo, Xiaomei
Yuan, Lingling
Holding, David R.
Zhang, Chi
author_facet Dou, Yongchao
Guo, Xiaomei
Yuan, Lingling
Holding, David R.
Zhang, Chi
author_sort Dou, Yongchao
collection PubMed
description To improve the applicability of RNA-seq technology, a large number of RNA-seq data analysis methods and correction algorithms have been developed. Although these new methods and algorithms have steadily improved transcriptome analysis, greater prediction accuracy is needed to better guide experimental designs with computational results. In this study, a new tool for the identification of differentially expressed genes with RNA-seq data, named GExposer, was developed. This tool introduces a local normalization algorithm to reduce the bias of nonrandomly positioned read depth. The naive Bayes classifier is employed to integrate fold change, transcript length, and GC content to identify differentially expressed genes. Results on several independent tests show that GExposer has better performance than other methods. The combination of the local normalization algorithm and naive Bayes classifier with three attributes can achieve better results; both false positive rates and false negative rates are reduced. However, only a small portion of genes is affected by the local normalization and GC content correction.
format Online
Article
Text
id pubmed-4538581
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-45385812015-09-03 Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization Dou, Yongchao Guo, Xiaomei Yuan, Lingling Holding, David R. Zhang, Chi Biomed Res Int Research Article To improve the applicability of RNA-seq technology, a large number of RNA-seq data analysis methods and correction algorithms have been developed. Although these new methods and algorithms have steadily improved transcriptome analysis, greater prediction accuracy is needed to better guide experimental designs with computational results. In this study, a new tool for the identification of differentially expressed genes with RNA-seq data, named GExposer, was developed. This tool introduces a local normalization algorithm to reduce the bias of nonrandomly positioned read depth. The naive Bayes classifier is employed to integrate fold change, transcript length, and GC content to identify differentially expressed genes. Results on several independent tests show that GExposer has better performance than other methods. The combination of the local normalization algorithm and naive Bayes classifier with three attributes can achieve better results; both false positive rates and false negative rates are reduced. However, only a small portion of genes is affected by the local normalization and GC content correction. Hindawi Publishing Corporation 2015 2015-08-03 /pmc/articles/PMC4538581/ /pubmed/26339642 http://dx.doi.org/10.1155/2015/789516 Text en Copyright © 2015 Yongchao Dou et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Dou, Yongchao
Guo, Xiaomei
Yuan, Lingling
Holding, David R.
Zhang, Chi
Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization
title Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization
title_full Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization
title_fullStr Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization
title_full_unstemmed Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization
title_short Differential Expression Analysis in RNA-Seq by a Naive Bayes Classifier with Local Normalization
title_sort differential expression analysis in rna-seq by a naive bayes classifier with local normalization
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4538581/
https://www.ncbi.nlm.nih.gov/pubmed/26339642
http://dx.doi.org/10.1155/2015/789516
work_keys_str_mv AT douyongchao differentialexpressionanalysisinrnaseqbyanaivebayesclassifierwithlocalnormalization
AT guoxiaomei differentialexpressionanalysisinrnaseqbyanaivebayesclassifierwithlocalnormalization
AT yuanlingling differentialexpressionanalysisinrnaseqbyanaivebayesclassifierwithlocalnormalization
AT holdingdavidr differentialexpressionanalysisinrnaseqbyanaivebayesclassifierwithlocalnormalization
AT zhangchi differentialexpressionanalysisinrnaseqbyanaivebayesclassifierwithlocalnormalization