Cargando…
An iteration normalization and test method for differential expression analysis of RNA-seq data
BACKGROUND: Next generation sequencing technologies are powerful new tools for investigating a wide range of biological and medical questions. Statistical and computational methods are key to analyzing massive and complex sequencing data. In order to derive gene expression measures and compare these...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4181730/ https://www.ncbi.nlm.nih.gov/pubmed/25285156 http://dx.doi.org/10.1186/1756-0381-7-15 |
_version_ | 1782337408621084672 |
---|---|
author | Zhou, Yan Lin, Nan Zhang, Baoxue |
author_facet | Zhou, Yan Lin, Nan Zhang, Baoxue |
author_sort | Zhou, Yan |
collection | PubMed |
description | BACKGROUND: Next generation sequencing technologies are powerful new tools for investigating a wide range of biological and medical questions. Statistical and computational methods are key to analyzing massive and complex sequencing data. In order to derive gene expression measures and compare these measures across samples or libraries, we first need to normalize read counts to adjust for varying sample sequencing depths and other potentially technical effects. RESULTS: In this paper, we develop a normalization method based on iterating median of M-values (IMM) for detecting the differentially expressed (DE) genes. Compared to a previous approach TMM, the IMM method improves the accuracy of DE detection. Simulation studies show that the IMM method outperforms other methods for the sample normalization. We also look into the real data and find that the genes detected by IMM but not by TMM are much more accurate than the genes detected by TMM but not by IMM. What’s more, we discovered that gene UNC5C is highly associated with kidney cancer and so on. |
format | Online Article Text |
id | pubmed-4181730 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41817302014-10-03 An iteration normalization and test method for differential expression analysis of RNA-seq data Zhou, Yan Lin, Nan Zhang, Baoxue BioData Min Methodology BACKGROUND: Next generation sequencing technologies are powerful new tools for investigating a wide range of biological and medical questions. Statistical and computational methods are key to analyzing massive and complex sequencing data. In order to derive gene expression measures and compare these measures across samples or libraries, we first need to normalize read counts to adjust for varying sample sequencing depths and other potentially technical effects. RESULTS: In this paper, we develop a normalization method based on iterating median of M-values (IMM) for detecting the differentially expressed (DE) genes. Compared to a previous approach TMM, the IMM method improves the accuracy of DE detection. Simulation studies show that the IMM method outperforms other methods for the sample normalization. We also look into the real data and find that the genes detected by IMM but not by TMM are much more accurate than the genes detected by TMM but not by IMM. What’s more, we discovered that gene UNC5C is highly associated with kidney cancer and so on. BioMed Central 2014-08-13 /pmc/articles/PMC4181730/ /pubmed/25285156 http://dx.doi.org/10.1186/1756-0381-7-15 Text en Copyright © 2014 Zhou et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Zhou, Yan Lin, Nan Zhang, Baoxue An iteration normalization and test method for differential expression analysis of RNA-seq data |
title | An iteration normalization and test method for differential expression analysis of RNA-seq data |
title_full | An iteration normalization and test method for differential expression analysis of RNA-seq data |
title_fullStr | An iteration normalization and test method for differential expression analysis of RNA-seq data |
title_full_unstemmed | An iteration normalization and test method for differential expression analysis of RNA-seq data |
title_short | An iteration normalization and test method for differential expression analysis of RNA-seq data |
title_sort | iteration normalization and test method for differential expression analysis of rna-seq data |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4181730/ https://www.ncbi.nlm.nih.gov/pubmed/25285156 http://dx.doi.org/10.1186/1756-0381-7-15 |
work_keys_str_mv | AT zhouyan aniterationnormalizationandtestmethodfordifferentialexpressionanalysisofrnaseqdata AT linnan aniterationnormalizationandtestmethodfordifferentialexpressionanalysisofrnaseqdata AT zhangbaoxue aniterationnormalizationandtestmethodfordifferentialexpressionanalysisofrnaseqdata AT zhouyan iterationnormalizationandtestmethodfordifferentialexpressionanalysisofrnaseqdata AT linnan iterationnormalizationandtestmethodfordifferentialexpressionanalysisofrnaseqdata AT zhangbaoxue iterationnormalizationandtestmethodfordifferentialexpressionanalysisofrnaseqdata |