Cargando…

A Hypothesis Testing Based Method for Normalization and Differential Expression Analysis of RNA-Seq Data

Next-generation sequencing technologies have made RNA sequencing (RNA-seq) a popular choice for measuring gene expression level. To reduce the noise of gene expression measures and compare them between several conditions or samples, normalization is an essential step to adjust for varying sample seq...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Yan, Wang, Guochang, Zhang, Jun, Li, Han
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5224994/
https://www.ncbi.nlm.nih.gov/pubmed/28072846
http://dx.doi.org/10.1371/journal.pone.0169594
_version_ 1782493444906680320
author Zhou, Yan
Wang, Guochang
Zhang, Jun
Li, Han
author_facet Zhou, Yan
Wang, Guochang
Zhang, Jun
Li, Han
author_sort Zhou, Yan
collection PubMed
description Next-generation sequencing technologies have made RNA sequencing (RNA-seq) a popular choice for measuring gene expression level. To reduce the noise of gene expression measures and compare them between several conditions or samples, normalization is an essential step to adjust for varying sample sequencing depths and other unwanted technical effects. In this paper, we develop a novel global scaling normalization method by employing the available knowledge of housekeeping genes. We formulate the problem from the hypothesis testing perspective and find an optimal scaling factor that minimizes the deviation between the empirical and the nominal type I error. Applying our approach to various simulation studies and real examples, we demonstrate that it is more accurate and robust than the state-of-the-art alternatives in detecting differentially expression genes.
format Online
Article
Text
id pubmed-5224994
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-52249942017-01-31 A Hypothesis Testing Based Method for Normalization and Differential Expression Analysis of RNA-Seq Data Zhou, Yan Wang, Guochang Zhang, Jun Li, Han PLoS One Research Article Next-generation sequencing technologies have made RNA sequencing (RNA-seq) a popular choice for measuring gene expression level. To reduce the noise of gene expression measures and compare them between several conditions or samples, normalization is an essential step to adjust for varying sample sequencing depths and other unwanted technical effects. In this paper, we develop a novel global scaling normalization method by employing the available knowledge of housekeeping genes. We formulate the problem from the hypothesis testing perspective and find an optimal scaling factor that minimizes the deviation between the empirical and the nominal type I error. Applying our approach to various simulation studies and real examples, we demonstrate that it is more accurate and robust than the state-of-the-art alternatives in detecting differentially expression genes. Public Library of Science 2017-01-10 /pmc/articles/PMC5224994/ /pubmed/28072846 http://dx.doi.org/10.1371/journal.pone.0169594 Text en © 2017 Zhou et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zhou, Yan
Wang, Guochang
Zhang, Jun
Li, Han
A Hypothesis Testing Based Method for Normalization and Differential Expression Analysis of RNA-Seq Data
title A Hypothesis Testing Based Method for Normalization and Differential Expression Analysis of RNA-Seq Data
title_full A Hypothesis Testing Based Method for Normalization and Differential Expression Analysis of RNA-Seq Data
title_fullStr A Hypothesis Testing Based Method for Normalization and Differential Expression Analysis of RNA-Seq Data
title_full_unstemmed A Hypothesis Testing Based Method for Normalization and Differential Expression Analysis of RNA-Seq Data
title_short A Hypothesis Testing Based Method for Normalization and Differential Expression Analysis of RNA-Seq Data
title_sort hypothesis testing based method for normalization and differential expression analysis of rna-seq data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5224994/
https://www.ncbi.nlm.nih.gov/pubmed/28072846
http://dx.doi.org/10.1371/journal.pone.0169594
work_keys_str_mv AT zhouyan ahypothesistestingbasedmethodfornormalizationanddifferentialexpressionanalysisofrnaseqdata
AT wangguochang ahypothesistestingbasedmethodfornormalizationanddifferentialexpressionanalysisofrnaseqdata
AT zhangjun ahypothesistestingbasedmethodfornormalizationanddifferentialexpressionanalysisofrnaseqdata
AT lihan ahypothesistestingbasedmethodfornormalizationanddifferentialexpressionanalysisofrnaseqdata
AT zhouyan hypothesistestingbasedmethodfornormalizationanddifferentialexpressionanalysisofrnaseqdata
AT wangguochang hypothesistestingbasedmethodfornormalizationanddifferentialexpressionanalysisofrnaseqdata
AT zhangjun hypothesistestingbasedmethodfornormalizationanddifferentialexpressionanalysisofrnaseqdata
AT lihan hypothesistestingbasedmethodfornormalizationanddifferentialexpressionanalysisofrnaseqdata