Cargando…
De-correlating expression in gene-set analysis
Motivation: Group-wise pattern analysis of genes, known as gene-set analysis (GSA), addresses the differential expression pattern of biologically pre-defined gene sets. GSA exhibits high statistical power and has revealed many novel biological processes associated with specific phenotypes. In most c...
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935420/ https://www.ncbi.nlm.nih.gov/pubmed/20823315 http://dx.doi.org/10.1093/bioinformatics/btq380 |
_version_ | 1782186398860705792 |
---|---|
author | Nam, Dougu |
author_facet | Nam, Dougu |
author_sort | Nam, Dougu |
collection | PubMed |
description | Motivation: Group-wise pattern analysis of genes, known as gene-set analysis (GSA), addresses the differential expression pattern of biologically pre-defined gene sets. GSA exhibits high statistical power and has revealed many novel biological processes associated with specific phenotypes. In most cases, however, GSA relies on the invalid assumption that the members of each gene set are sampled independently, which increases false predictions. Results: We propose an algorithm, termed DECO, to remove (or alleviate) the bias caused by the correlation of the expression data in GSAs. This is accomplished through the eigenvalue-decomposition of covariance matrixes and a series of linear transformations of data. In particular, moderate de-correlation methods that truncate or re-scale eigenvalues were proposed for a more reliable analysis. Tests of simulated and real experimental data show that DECO effectively corrects the correlation structure of gene expression and improves the prediction accuracy (specificity and sensitivity) for both gene- and sample-randomizing GSA methods. Availability: The MATLAB codes and the tested data sets are available at ftp://deco.nims.re.kr/pub or from the author. Contact: dougnam@unist.ac.kr |
format | Text |
id | pubmed-2935420 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-29354202010-09-08 De-correlating expression in gene-set analysis Nam, Dougu Bioinformatics Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium Motivation: Group-wise pattern analysis of genes, known as gene-set analysis (GSA), addresses the differential expression pattern of biologically pre-defined gene sets. GSA exhibits high statistical power and has revealed many novel biological processes associated with specific phenotypes. In most cases, however, GSA relies on the invalid assumption that the members of each gene set are sampled independently, which increases false predictions. Results: We propose an algorithm, termed DECO, to remove (or alleviate) the bias caused by the correlation of the expression data in GSAs. This is accomplished through the eigenvalue-decomposition of covariance matrixes and a series of linear transformations of data. In particular, moderate de-correlation methods that truncate or re-scale eigenvalues were proposed for a more reliable analysis. Tests of simulated and real experimental data show that DECO effectively corrects the correlation structure of gene expression and improves the prediction accuracy (specificity and sensitivity) for both gene- and sample-randomizing GSA methods. Availability: The MATLAB codes and the tested data sets are available at ftp://deco.nims.re.kr/pub or from the author. Contact: dougnam@unist.ac.kr Oxford University Press 2010-09-15 2010-09-04 /pmc/articles/PMC2935420/ /pubmed/20823315 http://dx.doi.org/10.1093/bioinformatics/btq380 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium Nam, Dougu De-correlating expression in gene-set analysis |
title | De-correlating expression in gene-set analysis |
title_full | De-correlating expression in gene-set analysis |
title_fullStr | De-correlating expression in gene-set analysis |
title_full_unstemmed | De-correlating expression in gene-set analysis |
title_short | De-correlating expression in gene-set analysis |
title_sort | de-correlating expression in gene-set analysis |
topic | Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935420/ https://www.ncbi.nlm.nih.gov/pubmed/20823315 http://dx.doi.org/10.1093/bioinformatics/btq380 |
work_keys_str_mv | AT namdougu decorrelatingexpressioningenesetanalysis |