Cargando…

An iteration normalization and test method for differential expression analysis of RNA-seq data

BACKGROUND: Next generation sequencing technologies are powerful new tools for investigating a wide range of biological and medical questions. Statistical and computational methods are key to analyzing massive and complex sequencing data. In order to derive gene expression measures and compare these...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Yan, Lin, Nan, Zhang, Baoxue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4181730/
https://www.ncbi.nlm.nih.gov/pubmed/25285156
http://dx.doi.org/10.1186/1756-0381-7-15
_version_ 1782337408621084672
author Zhou, Yan
Lin, Nan
Zhang, Baoxue
author_facet Zhou, Yan
Lin, Nan
Zhang, Baoxue
author_sort Zhou, Yan
collection PubMed
description BACKGROUND: Next generation sequencing technologies are powerful new tools for investigating a wide range of biological and medical questions. Statistical and computational methods are key to analyzing massive and complex sequencing data. In order to derive gene expression measures and compare these measures across samples or libraries, we first need to normalize read counts to adjust for varying sample sequencing depths and other potentially technical effects. RESULTS: In this paper, we develop a normalization method based on iterating median of M-values (IMM) for detecting the differentially expressed (DE) genes. Compared to a previous approach TMM, the IMM method improves the accuracy of DE detection. Simulation studies show that the IMM method outperforms other methods for the sample normalization. We also look into the real data and find that the genes detected by IMM but not by TMM are much more accurate than the genes detected by TMM but not by IMM. What’s more, we discovered that gene UNC5C is highly associated with kidney cancer and so on.
format Online
Article
Text
id pubmed-4181730
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41817302014-10-03 An iteration normalization and test method for differential expression analysis of RNA-seq data Zhou, Yan Lin, Nan Zhang, Baoxue BioData Min Methodology BACKGROUND: Next generation sequencing technologies are powerful new tools for investigating a wide range of biological and medical questions. Statistical and computational methods are key to analyzing massive and complex sequencing data. In order to derive gene expression measures and compare these measures across samples or libraries, we first need to normalize read counts to adjust for varying sample sequencing depths and other potentially technical effects. RESULTS: In this paper, we develop a normalization method based on iterating median of M-values (IMM) for detecting the differentially expressed (DE) genes. Compared to a previous approach TMM, the IMM method improves the accuracy of DE detection. Simulation studies show that the IMM method outperforms other methods for the sample normalization. We also look into the real data and find that the genes detected by IMM but not by TMM are much more accurate than the genes detected by TMM but not by IMM. What’s more, we discovered that gene UNC5C is highly associated with kidney cancer and so on. BioMed Central 2014-08-13 /pmc/articles/PMC4181730/ /pubmed/25285156 http://dx.doi.org/10.1186/1756-0381-7-15 Text en Copyright © 2014 Zhou et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Zhou, Yan
Lin, Nan
Zhang, Baoxue
An iteration normalization and test method for differential expression analysis of RNA-seq data
title An iteration normalization and test method for differential expression analysis of RNA-seq data
title_full An iteration normalization and test method for differential expression analysis of RNA-seq data
title_fullStr An iteration normalization and test method for differential expression analysis of RNA-seq data
title_full_unstemmed An iteration normalization and test method for differential expression analysis of RNA-seq data
title_short An iteration normalization and test method for differential expression analysis of RNA-seq data
title_sort iteration normalization and test method for differential expression analysis of rna-seq data
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4181730/
https://www.ncbi.nlm.nih.gov/pubmed/25285156
http://dx.doi.org/10.1186/1756-0381-7-15
work_keys_str_mv AT zhouyan aniterationnormalizationandtestmethodfordifferentialexpressionanalysisofrnaseqdata
AT linnan aniterationnormalizationandtestmethodfordifferentialexpressionanalysisofrnaseqdata
AT zhangbaoxue aniterationnormalizationandtestmethodfordifferentialexpressionanalysisofrnaseqdata
AT zhouyan iterationnormalizationandtestmethodfordifferentialexpressionanalysisofrnaseqdata
AT linnan iterationnormalizationandtestmethodfordifferentialexpressionanalysisofrnaseqdata
AT zhangbaoxue iterationnormalizationandtestmethodfordifferentialexpressionanalysisofrnaseqdata