Cargando…

Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks

BACKGROUND: We generalized penalized canonical correlation analysis for analyzing microarray gene-expression measurements for checking completeness of known metabolic pathways and identifying candidate genes for incorporation in the pathway. We used Wold's method for calculation of the canonica...

Descripción completa

Detalles Bibliográficos
Autores principales: Waaijenborg, Sandra, Zwinderman, Aeilko H
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2760886/
https://www.ncbi.nlm.nih.gov/pubmed/19785734
http://dx.doi.org/10.1186/1471-2105-10-315
_version_ 1782172795079229440
author Waaijenborg, Sandra
Zwinderman, Aeilko H
author_facet Waaijenborg, Sandra
Zwinderman, Aeilko H
author_sort Waaijenborg, Sandra
collection PubMed
description BACKGROUND: We generalized penalized canonical correlation analysis for analyzing microarray gene-expression measurements for checking completeness of known metabolic pathways and identifying candidate genes for incorporation in the pathway. We used Wold's method for calculation of the canonical variates, and we applied ridge penalization to the regression of pathway genes on canonical variates of the non-pathway genes, and the elastic net to the regression of non-pathway genes on the canonical variates of the pathway genes. RESULTS: We performed a small simulation to illustrate the model's capability to identify new candidate genes to incorporate in the pathway: in our simulations it appeared that a gene was correctly identified if the correlation with the pathway genes was 0.3 or more. We applied the methods to a gene-expression microarray data set of 12, 209 genes measured in 45 patients with glioblastoma, and we considered genes to incorporate in the glioma-pathway: we identified more than 25 genes that correlated > 0.9 with canonical variates of the pathway genes. CONCLUSION: We concluded that penalized canonical correlation analysis is a powerful tool to identify candidate genes in pathway analysis.
format Text
id pubmed-2760886
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27608862009-10-13 Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks Waaijenborg, Sandra Zwinderman, Aeilko H BMC Bioinformatics Methodology Article BACKGROUND: We generalized penalized canonical correlation analysis for analyzing microarray gene-expression measurements for checking completeness of known metabolic pathways and identifying candidate genes for incorporation in the pathway. We used Wold's method for calculation of the canonical variates, and we applied ridge penalization to the regression of pathway genes on canonical variates of the non-pathway genes, and the elastic net to the regression of non-pathway genes on the canonical variates of the pathway genes. RESULTS: We performed a small simulation to illustrate the model's capability to identify new candidate genes to incorporate in the pathway: in our simulations it appeared that a gene was correctly identified if the correlation with the pathway genes was 0.3 or more. We applied the methods to a gene-expression microarray data set of 12, 209 genes measured in 45 patients with glioblastoma, and we considered genes to incorporate in the glioma-pathway: we identified more than 25 genes that correlated > 0.9 with canonical variates of the pathway genes. CONCLUSION: We concluded that penalized canonical correlation analysis is a powerful tool to identify candidate genes in pathway analysis. BioMed Central 2009-09-28 /pmc/articles/PMC2760886/ /pubmed/19785734 http://dx.doi.org/10.1186/1471-2105-10-315 Text en Copyright © 2009 Waaijenborg and Zwinderman; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Waaijenborg, Sandra
Zwinderman, Aeilko H
Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks
title Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks
title_full Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks
title_fullStr Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks
title_full_unstemmed Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks
title_short Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks
title_sort sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2760886/
https://www.ncbi.nlm.nih.gov/pubmed/19785734
http://dx.doi.org/10.1186/1471-2105-10-315
work_keys_str_mv AT waaijenborgsandra sparsecanonicalcorrelationanalysisforidentifyingconnectingandcompletinggeneexpressionnetworks
AT zwindermanaeilkoh sparsecanonicalcorrelationanalysisforidentifyingconnectingandcompletinggeneexpressionnetworks