Cargando…

De-correlating expression in gene-set analysis

Motivation: Group-wise pattern analysis of genes, known as gene-set analysis (GSA), addresses the differential expression pattern of biologically pre-defined gene sets. GSA exhibits high statistical power and has revealed many novel biological processes associated with specific phenotypes. In most c...

Descripción completa

Detalles Bibliográficos
Autor principal: Nam, Dougu
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935420/
https://www.ncbi.nlm.nih.gov/pubmed/20823315
http://dx.doi.org/10.1093/bioinformatics/btq380
_version_ 1782186398860705792
author Nam, Dougu
author_facet Nam, Dougu
author_sort Nam, Dougu
collection PubMed
description Motivation: Group-wise pattern analysis of genes, known as gene-set analysis (GSA), addresses the differential expression pattern of biologically pre-defined gene sets. GSA exhibits high statistical power and has revealed many novel biological processes associated with specific phenotypes. In most cases, however, GSA relies on the invalid assumption that the members of each gene set are sampled independently, which increases false predictions. Results: We propose an algorithm, termed DECO, to remove (or alleviate) the bias caused by the correlation of the expression data in GSAs. This is accomplished through the eigenvalue-decomposition of covariance matrixes and a series of linear transformations of data. In particular, moderate de-correlation methods that truncate or re-scale eigenvalues were proposed for a more reliable analysis. Tests of simulated and real experimental data show that DECO effectively corrects the correlation structure of gene expression and improves the prediction accuracy (specificity and sensitivity) for both gene- and sample-randomizing GSA methods. Availability: The MATLAB codes and the tested data sets are available at ftp://deco.nims.re.kr/pub or from the author. Contact: dougnam@unist.ac.kr
format Text
id pubmed-2935420
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-29354202010-09-08 De-correlating expression in gene-set analysis Nam, Dougu Bioinformatics Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium Motivation: Group-wise pattern analysis of genes, known as gene-set analysis (GSA), addresses the differential expression pattern of biologically pre-defined gene sets. GSA exhibits high statistical power and has revealed many novel biological processes associated with specific phenotypes. In most cases, however, GSA relies on the invalid assumption that the members of each gene set are sampled independently, which increases false predictions. Results: We propose an algorithm, termed DECO, to remove (or alleviate) the bias caused by the correlation of the expression data in GSAs. This is accomplished through the eigenvalue-decomposition of covariance matrixes and a series of linear transformations of data. In particular, moderate de-correlation methods that truncate or re-scale eigenvalues were proposed for a more reliable analysis. Tests of simulated and real experimental data show that DECO effectively corrects the correlation structure of gene expression and improves the prediction accuracy (specificity and sensitivity) for both gene- and sample-randomizing GSA methods. Availability: The MATLAB codes and the tested data sets are available at ftp://deco.nims.re.kr/pub or from the author. Contact: dougnam@unist.ac.kr Oxford University Press 2010-09-15 2010-09-04 /pmc/articles/PMC2935420/ /pubmed/20823315 http://dx.doi.org/10.1093/bioinformatics/btq380 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium
Nam, Dougu
De-correlating expression in gene-set analysis
title De-correlating expression in gene-set analysis
title_full De-correlating expression in gene-set analysis
title_fullStr De-correlating expression in gene-set analysis
title_full_unstemmed De-correlating expression in gene-set analysis
title_short De-correlating expression in gene-set analysis
title_sort de-correlating expression in gene-set analysis
topic Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935420/
https://www.ncbi.nlm.nih.gov/pubmed/20823315
http://dx.doi.org/10.1093/bioinformatics/btq380
work_keys_str_mv AT namdougu decorrelatingexpressioningenesetanalysis