Cargando…

Independent component analysis reveals new and biologically significant structures in micro array data

BACKGROUND: An alternative to standard approaches to uncover biologically meaningful structures in micro array data is to treat the data as a blind source separation (BSS) problem. BSS attempts to separate a mixture of signals into their different sources and refers to the problem of recovering sign...

Descripción completa

Detalles Bibliográficos
Autores principales: Frigyesi, Attila, Veerla, Srinivas, Lindgren, David, Höglund, Mattias
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1557674/
https://www.ncbi.nlm.nih.gov/pubmed/16762055
http://dx.doi.org/10.1186/1471-2105-7-290
_version_ 1782129393623105536
author Frigyesi, Attila
Veerla, Srinivas
Lindgren, David
Höglund, Mattias
author_facet Frigyesi, Attila
Veerla, Srinivas
Lindgren, David
Höglund, Mattias
author_sort Frigyesi, Attila
collection PubMed
description BACKGROUND: An alternative to standard approaches to uncover biologically meaningful structures in micro array data is to treat the data as a blind source separation (BSS) problem. BSS attempts to separate a mixture of signals into their different sources and refers to the problem of recovering signals from several observed linear mixtures. In the context of micro array data, "sources" may correspond to specific cellular responses or to co-regulated genes. RESULTS: We applied independent component analysis (ICA) to three different microarray data sets; two tumor data sets and one time series experiment. To obtain reliable components we used iterated ICA to estimate component centrotypes. We found that many of the low ranking components indeed may show a strong biological coherence and hence be of biological significance. Generally ICA achieved a higher resolution when compared with results based on correlated expression and a larger number of gene clusters with significantly enriched for gene ontology (GO) categories. In addition, components characteristic for molecular subtypes and for tumors with specific chromosomal translocations were identified. ICA also identified more than one gene clusters significant for the same GO categories and hence disclosed a higher level of biological heterogeneity, even within coherent groups of genes. CONCLUSION: Although the ICA approach primarily detects hidden variables, these surfaced as highly correlated genes in time series data and in one instance in the tumor data. This further strengthens the biological relevance of latent variables detected by ICA.
format Text
id pubmed-1557674
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15576742006-08-31 Independent component analysis reveals new and biologically significant structures in micro array data Frigyesi, Attila Veerla, Srinivas Lindgren, David Höglund, Mattias BMC Bioinformatics Methodology Article BACKGROUND: An alternative to standard approaches to uncover biologically meaningful structures in micro array data is to treat the data as a blind source separation (BSS) problem. BSS attempts to separate a mixture of signals into their different sources and refers to the problem of recovering signals from several observed linear mixtures. In the context of micro array data, "sources" may correspond to specific cellular responses or to co-regulated genes. RESULTS: We applied independent component analysis (ICA) to three different microarray data sets; two tumor data sets and one time series experiment. To obtain reliable components we used iterated ICA to estimate component centrotypes. We found that many of the low ranking components indeed may show a strong biological coherence and hence be of biological significance. Generally ICA achieved a higher resolution when compared with results based on correlated expression and a larger number of gene clusters with significantly enriched for gene ontology (GO) categories. In addition, components characteristic for molecular subtypes and for tumors with specific chromosomal translocations were identified. ICA also identified more than one gene clusters significant for the same GO categories and hence disclosed a higher level of biological heterogeneity, even within coherent groups of genes. CONCLUSION: Although the ICA approach primarily detects hidden variables, these surfaced as highly correlated genes in time series data and in one instance in the tumor data. This further strengthens the biological relevance of latent variables detected by ICA. BioMed Central 2006-06-08 /pmc/articles/PMC1557674/ /pubmed/16762055 http://dx.doi.org/10.1186/1471-2105-7-290 Text en Copyright © 2006 Frigyesi et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Frigyesi, Attila
Veerla, Srinivas
Lindgren, David
Höglund, Mattias
Independent component analysis reveals new and biologically significant structures in micro array data
title Independent component analysis reveals new and biologically significant structures in micro array data
title_full Independent component analysis reveals new and biologically significant structures in micro array data
title_fullStr Independent component analysis reveals new and biologically significant structures in micro array data
title_full_unstemmed Independent component analysis reveals new and biologically significant structures in micro array data
title_short Independent component analysis reveals new and biologically significant structures in micro array data
title_sort independent component analysis reveals new and biologically significant structures in micro array data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1557674/
https://www.ncbi.nlm.nih.gov/pubmed/16762055
http://dx.doi.org/10.1186/1471-2105-7-290
work_keys_str_mv AT frigyesiattila independentcomponentanalysisrevealsnewandbiologicallysignificantstructuresinmicroarraydata
AT veerlasrinivas independentcomponentanalysisrevealsnewandbiologicallysignificantstructuresinmicroarraydata
AT lindgrendavid independentcomponentanalysisrevealsnewandbiologicallysignificantstructuresinmicroarraydata
AT hoglundmattias independentcomponentanalysisrevealsnewandbiologicallysignificantstructuresinmicroarraydata