Cargando…

A comparison of feature extraction methods for prediction of neuropsychological scores from functional connectivity data of stroke patients

Multivariate prediction of human behavior from resting state data is gaining increasing popularity in the neuroimaging community, with far-reaching translational implications in neurology and psychiatry. However, the high dimensionality of neuroimaging data increases the risk of overfitting, calling...

Descripción completa

Detalles Bibliográficos
Autores principales: Calesella, Federico, Testolin, Alberto, De Filippo De Grazia, Michele, Zorzi, Marco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8058135/
https://www.ncbi.nlm.nih.gov/pubmed/33877469
http://dx.doi.org/10.1186/s40708-021-00129-1
_version_ 1783680969535913984
author Calesella, Federico
Testolin, Alberto
De Filippo De Grazia, Michele
Zorzi, Marco
author_facet Calesella, Federico
Testolin, Alberto
De Filippo De Grazia, Michele
Zorzi, Marco
author_sort Calesella, Federico
collection PubMed
description Multivariate prediction of human behavior from resting state data is gaining increasing popularity in the neuroimaging community, with far-reaching translational implications in neurology and psychiatry. However, the high dimensionality of neuroimaging data increases the risk of overfitting, calling for the use of dimensionality reduction methods to build robust predictive models. In this work, we assess the ability of four well-known dimensionality reduction techniques to extract relevant features from resting state functional connectivity matrices of stroke patients, which are then used to build a predictive model of the associated deficits based on cross-validated regularized regression. In particular, we investigated the prediction ability over different neuropsychological scores referring to language, verbal memory, and spatial memory domains. Principal Component Analysis (PCA) and Independent Component Analysis (ICA) were the two best methods at extracting representative features, followed by Dictionary Learning (DL) and Non-Negative Matrix Factorization (NNMF). Consistent with these results, features extracted by PCA and ICA were found to be the best predictors of the neuropsychological scores across all the considered cognitive domains. For each feature extraction method, we also examined the impact of the regularization method, model complexity (in terms of number of features that entered in the model) and quality of the maps that display predictive edges in the resting state networks. We conclude that PCA-based models, especially when combined with L1 (LASSO) regularization, provide optimal balance between prediction accuracy, model complexity, and interpretability. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40708-021-00129-1.
format Online
Article
Text
id pubmed-8058135
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-80581352021-05-05 A comparison of feature extraction methods for prediction of neuropsychological scores from functional connectivity data of stroke patients Calesella, Federico Testolin, Alberto De Filippo De Grazia, Michele Zorzi, Marco Brain Inform Research Multivariate prediction of human behavior from resting state data is gaining increasing popularity in the neuroimaging community, with far-reaching translational implications in neurology and psychiatry. However, the high dimensionality of neuroimaging data increases the risk of overfitting, calling for the use of dimensionality reduction methods to build robust predictive models. In this work, we assess the ability of four well-known dimensionality reduction techniques to extract relevant features from resting state functional connectivity matrices of stroke patients, which are then used to build a predictive model of the associated deficits based on cross-validated regularized regression. In particular, we investigated the prediction ability over different neuropsychological scores referring to language, verbal memory, and spatial memory domains. Principal Component Analysis (PCA) and Independent Component Analysis (ICA) were the two best methods at extracting representative features, followed by Dictionary Learning (DL) and Non-Negative Matrix Factorization (NNMF). Consistent with these results, features extracted by PCA and ICA were found to be the best predictors of the neuropsychological scores across all the considered cognitive domains. For each feature extraction method, we also examined the impact of the regularization method, model complexity (in terms of number of features that entered in the model) and quality of the maps that display predictive edges in the resting state networks. We conclude that PCA-based models, especially when combined with L1 (LASSO) regularization, provide optimal balance between prediction accuracy, model complexity, and interpretability. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40708-021-00129-1. Springer Berlin Heidelberg 2021-04-20 /pmc/articles/PMC8058135/ /pubmed/33877469 http://dx.doi.org/10.1186/s40708-021-00129-1 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research
Calesella, Federico
Testolin, Alberto
De Filippo De Grazia, Michele
Zorzi, Marco
A comparison of feature extraction methods for prediction of neuropsychological scores from functional connectivity data of stroke patients
title A comparison of feature extraction methods for prediction of neuropsychological scores from functional connectivity data of stroke patients
title_full A comparison of feature extraction methods for prediction of neuropsychological scores from functional connectivity data of stroke patients
title_fullStr A comparison of feature extraction methods for prediction of neuropsychological scores from functional connectivity data of stroke patients
title_full_unstemmed A comparison of feature extraction methods for prediction of neuropsychological scores from functional connectivity data of stroke patients
title_short A comparison of feature extraction methods for prediction of neuropsychological scores from functional connectivity data of stroke patients
title_sort comparison of feature extraction methods for prediction of neuropsychological scores from functional connectivity data of stroke patients
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8058135/
https://www.ncbi.nlm.nih.gov/pubmed/33877469
http://dx.doi.org/10.1186/s40708-021-00129-1
work_keys_str_mv AT calesellafederico acomparisonoffeatureextractionmethodsforpredictionofneuropsychologicalscoresfromfunctionalconnectivitydataofstrokepatients
AT testolinalberto acomparisonoffeatureextractionmethodsforpredictionofneuropsychologicalscoresfromfunctionalconnectivitydataofstrokepatients
AT defilippodegraziamichele acomparisonoffeatureextractionmethodsforpredictionofneuropsychologicalscoresfromfunctionalconnectivitydataofstrokepatients
AT zorzimarco acomparisonoffeatureextractionmethodsforpredictionofneuropsychologicalscoresfromfunctionalconnectivitydataofstrokepatients
AT calesellafederico comparisonoffeatureextractionmethodsforpredictionofneuropsychologicalscoresfromfunctionalconnectivitydataofstrokepatients
AT testolinalberto comparisonoffeatureextractionmethodsforpredictionofneuropsychologicalscoresfromfunctionalconnectivitydataofstrokepatients
AT defilippodegraziamichele comparisonoffeatureextractionmethodsforpredictionofneuropsychologicalscoresfromfunctionalconnectivitydataofstrokepatients
AT zorzimarco comparisonoffeatureextractionmethodsforpredictionofneuropsychologicalscoresfromfunctionalconnectivitydataofstrokepatients