Cargando…

An Integrated Approach for RNA-seq Data Normalization

BACKGROUND: DNA copy number alteration is common in many cancers. Studies have shown that insertion or deletion of DNA sequences can directly alter gene expression, and significant correlation exists between DNA copy number and gene expression. Data normalization is a critical step in the analysis o...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Shengping, Mercante, Donald E., Zhang, Kun, Fang, Zhide
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4924883/
https://www.ncbi.nlm.nih.gov/pubmed/27385909
http://dx.doi.org/10.4137/CIN.S39781
_version_ 1782439943969177600
author Yang, Shengping
Mercante, Donald E.
Zhang, Kun
Fang, Zhide
author_facet Yang, Shengping
Mercante, Donald E.
Zhang, Kun
Fang, Zhide
author_sort Yang, Shengping
collection PubMed
description BACKGROUND: DNA copy number alteration is common in many cancers. Studies have shown that insertion or deletion of DNA sequences can directly alter gene expression, and significant correlation exists between DNA copy number and gene expression. Data normalization is a critical step in the analysis of gene expression generated by RNA-seq technology. Successful normalization reduces/removes unwanted nonbiological variations in the data, while keeping meaningful information intact. However, as far as we know, no attempt has been made to adjust for the variation due to DNA copy number changes in RNA-seq data normalization. RESULTS: In this article, we propose an integrated approach for RNA-seq data normalization. Comparisons show that the proposed normalization can improve power for downstream differentially expressed gene detection and generate more biologically meaningful results in gene profiling. In addition, our findings show that due to the effects of copy number changes, some housekeeping genes are not always suitable internal controls for studying gene expression. CONCLUSIONS: Using information from DNA copy number, integrated approach is successful in reducing noises due to both biological and nonbiological causes in RNA-seq data, thus increasing the accuracy of gene profiling.
format Online
Article
Text
id pubmed-4924883
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-49248832016-07-06 An Integrated Approach for RNA-seq Data Normalization Yang, Shengping Mercante, Donald E. Zhang, Kun Fang, Zhide Cancer Inform Original Research BACKGROUND: DNA copy number alteration is common in many cancers. Studies have shown that insertion or deletion of DNA sequences can directly alter gene expression, and significant correlation exists between DNA copy number and gene expression. Data normalization is a critical step in the analysis of gene expression generated by RNA-seq technology. Successful normalization reduces/removes unwanted nonbiological variations in the data, while keeping meaningful information intact. However, as far as we know, no attempt has been made to adjust for the variation due to DNA copy number changes in RNA-seq data normalization. RESULTS: In this article, we propose an integrated approach for RNA-seq data normalization. Comparisons show that the proposed normalization can improve power for downstream differentially expressed gene detection and generate more biologically meaningful results in gene profiling. In addition, our findings show that due to the effects of copy number changes, some housekeeping genes are not always suitable internal controls for studying gene expression. CONCLUSIONS: Using information from DNA copy number, integrated approach is successful in reducing noises due to both biological and nonbiological causes in RNA-seq data, thus increasing the accuracy of gene profiling. Libertas Academica 2016-06-27 /pmc/articles/PMC4924883/ /pubmed/27385909 http://dx.doi.org/10.4137/CIN.S39781 Text en © 2016 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 license.
spellingShingle Original Research
Yang, Shengping
Mercante, Donald E.
Zhang, Kun
Fang, Zhide
An Integrated Approach for RNA-seq Data Normalization
title An Integrated Approach for RNA-seq Data Normalization
title_full An Integrated Approach for RNA-seq Data Normalization
title_fullStr An Integrated Approach for RNA-seq Data Normalization
title_full_unstemmed An Integrated Approach for RNA-seq Data Normalization
title_short An Integrated Approach for RNA-seq Data Normalization
title_sort integrated approach for rna-seq data normalization
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4924883/
https://www.ncbi.nlm.nih.gov/pubmed/27385909
http://dx.doi.org/10.4137/CIN.S39781
work_keys_str_mv AT yangshengping anintegratedapproachforrnaseqdatanormalization
AT mercantedonalde anintegratedapproachforrnaseqdatanormalization
AT zhangkun anintegratedapproachforrnaseqdatanormalization
AT fangzhide anintegratedapproachforrnaseqdatanormalization
AT yangshengping integratedapproachforrnaseqdatanormalization
AT mercantedonalde integratedapproachforrnaseqdatanormalization
AT zhangkun integratedapproachforrnaseqdatanormalization
AT fangzhide integratedapproachforrnaseqdatanormalization