Cargando…

Permutation inference for canonical correlation analysis

Canonical correlation analysis (CCA) has become a key tool for population neuroimaging, allowing investigation of associations between many imaging and non-imaging measurements. As age, sex and other variables are often a source of variability not of direct interest, previous work has used CCA on re...

Descripción completa

Detalles Bibliográficos
Autores principales: Winkler, Anderson M., Renaud, Olivier, Smith, Stephen M., Nichols, Thomas E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Academic Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7573815/
https://www.ncbi.nlm.nih.gov/pubmed/32603857
http://dx.doi.org/10.1016/j.neuroimage.2020.117065
_version_ 1783597516674039808
author Winkler, Anderson M.
Renaud, Olivier
Smith, Stephen M.
Nichols, Thomas E.
author_facet Winkler, Anderson M.
Renaud, Olivier
Smith, Stephen M.
Nichols, Thomas E.
author_sort Winkler, Anderson M.
collection PubMed
description Canonical correlation analysis (CCA) has become a key tool for population neuroimaging, allowing investigation of associations between many imaging and non-imaging measurements. As age, sex and other variables are often a source of variability not of direct interest, previous work has used CCA on residuals from a model that removes these effects, then proceeded directly to permutation inference. We show that a simple permutation test, as typically used to identify significant modes of shared variation on such data adjusted for nuisance variables, produces inflated error rates. The reason is that residualisation introduces dependencies among the observations that violate the exchangeability assumption. Even in the absence of nuisance variables, however, a simple permutation test for CCA also leads to excess error rates for all canonical correlations other than the first. The reason is that a simple permutation scheme does not ignore the variability already explained by previous canonical variables. Here we propose solutions for both problems: in the case of nuisance variables, we show that transforming the residuals to a lower dimensional basis where exchangeability holds results in a valid permutation test; for more general cases, with or without nuisance variables, we propose estimating the canonical correlations in a stepwise manner, removing at each iteration the variance already explained, while dealing with different number of variables in both sides. We also discuss how to address the multiplicity of tests, proposing an admissible test that is not conservative, and provide a complete algorithm for permutation inference for CCA.
format Online
Article
Text
id pubmed-7573815
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Academic Press
record_format MEDLINE/PubMed
spelling pubmed-75738152020-10-23 Permutation inference for canonical correlation analysis Winkler, Anderson M. Renaud, Olivier Smith, Stephen M. Nichols, Thomas E. Neuroimage Article Canonical correlation analysis (CCA) has become a key tool for population neuroimaging, allowing investigation of associations between many imaging and non-imaging measurements. As age, sex and other variables are often a source of variability not of direct interest, previous work has used CCA on residuals from a model that removes these effects, then proceeded directly to permutation inference. We show that a simple permutation test, as typically used to identify significant modes of shared variation on such data adjusted for nuisance variables, produces inflated error rates. The reason is that residualisation introduces dependencies among the observations that violate the exchangeability assumption. Even in the absence of nuisance variables, however, a simple permutation test for CCA also leads to excess error rates for all canonical correlations other than the first. The reason is that a simple permutation scheme does not ignore the variability already explained by previous canonical variables. Here we propose solutions for both problems: in the case of nuisance variables, we show that transforming the residuals to a lower dimensional basis where exchangeability holds results in a valid permutation test; for more general cases, with or without nuisance variables, we propose estimating the canonical correlations in a stepwise manner, removing at each iteration the variance already explained, while dealing with different number of variables in both sides. We also discuss how to address the multiplicity of tests, proposing an admissible test that is not conservative, and provide a complete algorithm for permutation inference for CCA. Academic Press 2020-10-15 /pmc/articles/PMC7573815/ /pubmed/32603857 http://dx.doi.org/10.1016/j.neuroimage.2020.117065 Text en http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Winkler, Anderson M.
Renaud, Olivier
Smith, Stephen M.
Nichols, Thomas E.
Permutation inference for canonical correlation analysis
title Permutation inference for canonical correlation analysis
title_full Permutation inference for canonical correlation analysis
title_fullStr Permutation inference for canonical correlation analysis
title_full_unstemmed Permutation inference for canonical correlation analysis
title_short Permutation inference for canonical correlation analysis
title_sort permutation inference for canonical correlation analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7573815/
https://www.ncbi.nlm.nih.gov/pubmed/32603857
http://dx.doi.org/10.1016/j.neuroimage.2020.117065
work_keys_str_mv AT winklerandersonm permutationinferenceforcanonicalcorrelationanalysis
AT renaudolivier permutationinferenceforcanonicalcorrelationanalysis
AT smithstephenm permutationinferenceforcanonicalcorrelationanalysis
AT nicholsthomase permutationinferenceforcanonicalcorrelationanalysis