Cargando…
Confound Removal and Normalization in Practice: A Neuroimaging Based Sex Prediction Case Study
Machine learning (ML) methods are increasingly being used to predict pathologies and biological traits using neuroimaging data. Here controlling for confounds is essential to get unbiased estimates of generalization performance and to identify the features driving predictions. However, a systematic...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7903939/ http://dx.doi.org/10.1007/978-3-030-67670-4_1 |
_version_ | 1783654831968223232 |
---|---|
author | More, Shammi Eickhoff, Simon B. Caspers, Julian Patil, Kaustubh R. |
author_facet | More, Shammi Eickhoff, Simon B. Caspers, Julian Patil, Kaustubh R. |
author_sort | More, Shammi |
collection | PubMed |
description | Machine learning (ML) methods are increasingly being used to predict pathologies and biological traits using neuroimaging data. Here controlling for confounds is essential to get unbiased estimates of generalization performance and to identify the features driving predictions. However, a systematic evaluation of the advantages and disadvantages of available alternatives is lacking. This makes it difficult to compare results across studies and to build deployment quality models. Here, we evaluated two commonly used confound removal schemes–whole data confound regression (WDCR) and cross-validated confound regression (CVCR)–to understand their effectiveness and biases induced in generalization performance estimation. Additionally, we study the interaction of the confound removal schemes with Z-score normalization, a common practice in ML modelling. We applied eight combinations of confound removal schemes and normalization (pipelines) to decode sex from resting-state functional MRI (rfMRI) data while controlling for two confounds, brain size and age. We show that both schemes effectively remove linear univariate and multivariate confounding effects resulting in reduced model performance with CVCR providing better generalization estimates, i.e., closer to out-of-sample performance than WDCR. We found no effect of normalizing before or after confound removal. In the presence of dataset and confound shift, four tested confound removal procedures yielded mixed results, raising new questions. We conclude that CVCR is a better method to control for confounding effects in neuroimaging studies. We believe that our in-depth analyses shed light on choices associated with confound removal and hope that it generates more interest in this problem instrumental to numerous applications. |
format | Online Article Text |
id | pubmed-7903939 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
record_format | MEDLINE/PubMed |
spelling | pubmed-79039392021-02-25 Confound Removal and Normalization in Practice: A Neuroimaging Based Sex Prediction Case Study More, Shammi Eickhoff, Simon B. Caspers, Julian Patil, Kaustubh R. Machine Learning and Knowledge Discovery in Databases. Applied Data Science and Demo Track Article Machine learning (ML) methods are increasingly being used to predict pathologies and biological traits using neuroimaging data. Here controlling for confounds is essential to get unbiased estimates of generalization performance and to identify the features driving predictions. However, a systematic evaluation of the advantages and disadvantages of available alternatives is lacking. This makes it difficult to compare results across studies and to build deployment quality models. Here, we evaluated two commonly used confound removal schemes–whole data confound regression (WDCR) and cross-validated confound regression (CVCR)–to understand their effectiveness and biases induced in generalization performance estimation. Additionally, we study the interaction of the confound removal schemes with Z-score normalization, a common practice in ML modelling. We applied eight combinations of confound removal schemes and normalization (pipelines) to decode sex from resting-state functional MRI (rfMRI) data while controlling for two confounds, brain size and age. We show that both schemes effectively remove linear univariate and multivariate confounding effects resulting in reduced model performance with CVCR providing better generalization estimates, i.e., closer to out-of-sample performance than WDCR. We found no effect of normalizing before or after confound removal. In the presence of dataset and confound shift, four tested confound removal procedures yielded mixed results, raising new questions. We conclude that CVCR is a better method to control for confounding effects in neuroimaging studies. We believe that our in-depth analyses shed light on choices associated with confound removal and hope that it generates more interest in this problem instrumental to numerous applications. 2021-01-30 /pmc/articles/PMC7903939/ http://dx.doi.org/10.1007/978-3-030-67670-4_1 Text en © The Author(s) 2021 Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence and indicate if changes were made. The images or other third party material in this chapter are included in the chapter's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. |
spellingShingle | Article More, Shammi Eickhoff, Simon B. Caspers, Julian Patil, Kaustubh R. Confound Removal and Normalization in Practice: A Neuroimaging Based Sex Prediction Case Study |
title | Confound Removal and Normalization in Practice: A Neuroimaging Based Sex Prediction Case Study |
title_full | Confound Removal and Normalization in Practice: A Neuroimaging Based Sex Prediction Case Study |
title_fullStr | Confound Removal and Normalization in Practice: A Neuroimaging Based Sex Prediction Case Study |
title_full_unstemmed | Confound Removal and Normalization in Practice: A Neuroimaging Based Sex Prediction Case Study |
title_short | Confound Removal and Normalization in Practice: A Neuroimaging Based Sex Prediction Case Study |
title_sort | confound removal and normalization in practice: a neuroimaging based sex prediction case study |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7903939/ http://dx.doi.org/10.1007/978-3-030-67670-4_1 |
work_keys_str_mv | AT moreshammi confoundremovalandnormalizationinpracticeaneuroimagingbasedsexpredictioncasestudy AT eickhoffsimonb confoundremovalandnormalizationinpracticeaneuroimagingbasedsexpredictioncasestudy AT caspersjulian confoundremovalandnormalizationinpracticeaneuroimagingbasedsexpredictioncasestudy AT patilkaustubhr confoundremovalandnormalizationinpracticeaneuroimagingbasedsexpredictioncasestudy |