Cargando…

How Many Separable Sources? Model Selection In Independent Components Analysis

Unlike mixtures consisting solely of non-Gaussian sources, mixtures including two or more Gaussian components cannot be separated using standard independent components analysis methods that are based on higher order statistics and independent observations. The mixed Independent Components Analysis/P...

Descripción completa

Detalles Bibliográficos
Autores principales: Woods, Roger P., Hansen, Lars Kai, Strother, Stephen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4374758/
https://www.ncbi.nlm.nih.gov/pubmed/25811988
http://dx.doi.org/10.1371/journal.pone.0118877
_version_ 1782363539865862144
author Woods, Roger P.
Hansen, Lars Kai
Strother, Stephen
author_facet Woods, Roger P.
Hansen, Lars Kai
Strother, Stephen
author_sort Woods, Roger P.
collection PubMed
description Unlike mixtures consisting solely of non-Gaussian sources, mixtures including two or more Gaussian components cannot be separated using standard independent components analysis methods that are based on higher order statistics and independent observations. The mixed Independent Components Analysis/Principal Components Analysis (mixed ICA/PCA) model described here accommodates one or more Gaussian components in the independent components analysis model and uses principal components analysis to characterize contributions from this inseparable Gaussian subspace. Information theory can then be used to select from among potential model categories with differing numbers of Gaussian components. Based on simulation studies, the assumptions and approximations underlying the Akaike Information Criterion do not hold in this setting, even with a very large number of observations. Cross-validation is a suitable, though computationally intensive alternative for model selection. Application of the algorithm is illustrated using Fisher's iris data set and Howells' craniometric data set. Mixed ICA/PCA is of potential interest in any field of scientific investigation where the authenticity of blindly separated non-Gaussian sources might otherwise be questionable. Failure of the Akaike Information Criterion in model selection also has relevance in traditional independent components analysis where all sources are assumed non-Gaussian.
format Online
Article
Text
id pubmed-4374758
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-43747582015-04-04 How Many Separable Sources? Model Selection In Independent Components Analysis Woods, Roger P. Hansen, Lars Kai Strother, Stephen PLoS One Research Article Unlike mixtures consisting solely of non-Gaussian sources, mixtures including two or more Gaussian components cannot be separated using standard independent components analysis methods that are based on higher order statistics and independent observations. The mixed Independent Components Analysis/Principal Components Analysis (mixed ICA/PCA) model described here accommodates one or more Gaussian components in the independent components analysis model and uses principal components analysis to characterize contributions from this inseparable Gaussian subspace. Information theory can then be used to select from among potential model categories with differing numbers of Gaussian components. Based on simulation studies, the assumptions and approximations underlying the Akaike Information Criterion do not hold in this setting, even with a very large number of observations. Cross-validation is a suitable, though computationally intensive alternative for model selection. Application of the algorithm is illustrated using Fisher's iris data set and Howells' craniometric data set. Mixed ICA/PCA is of potential interest in any field of scientific investigation where the authenticity of blindly separated non-Gaussian sources might otherwise be questionable. Failure of the Akaike Information Criterion in model selection also has relevance in traditional independent components analysis where all sources are assumed non-Gaussian. Public Library of Science 2015-03-26 /pmc/articles/PMC4374758/ /pubmed/25811988 http://dx.doi.org/10.1371/journal.pone.0118877 Text en © 2015 Woods et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Woods, Roger P.
Hansen, Lars Kai
Strother, Stephen
How Many Separable Sources? Model Selection In Independent Components Analysis
title How Many Separable Sources? Model Selection In Independent Components Analysis
title_full How Many Separable Sources? Model Selection In Independent Components Analysis
title_fullStr How Many Separable Sources? Model Selection In Independent Components Analysis
title_full_unstemmed How Many Separable Sources? Model Selection In Independent Components Analysis
title_short How Many Separable Sources? Model Selection In Independent Components Analysis
title_sort how many separable sources? model selection in independent components analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4374758/
https://www.ncbi.nlm.nih.gov/pubmed/25811988
http://dx.doi.org/10.1371/journal.pone.0118877
work_keys_str_mv AT woodsrogerp howmanyseparablesourcesmodelselectioninindependentcomponentsanalysis
AT hansenlarskai howmanyseparablesourcesmodelselectioninindependentcomponentsanalysis
AT strotherstephen howmanyseparablesourcesmodelselectioninindependentcomponentsanalysis