Cargando…

Feeding the machine: Challenges to reproducible predictive modeling in resting-state connectomics

In this critical review, we examine the application of predictive models, for example, classifiers, trained using machine learning (ML) to assist in interpretation of functional neuroimaging data. Our primary goal is to summarize how ML is being applied and critically assess common practices. Our re...

Descripción completa

Detalles Bibliográficos
Autores principales: Cwiek, Andrew, Rajtmajer, Sarah M., Wyble, Bradley, Honavar, Vasant, Grossner, Emily, Hillary, Frank G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MIT Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8942606/
https://www.ncbi.nlm.nih.gov/pubmed/35350584
http://dx.doi.org/10.1162/netn_a_00212
_version_ 1784673337713098752
author Cwiek, Andrew
Rajtmajer, Sarah M.
Wyble, Bradley
Honavar, Vasant
Grossner, Emily
Hillary, Frank G.
author_facet Cwiek, Andrew
Rajtmajer, Sarah M.
Wyble, Bradley
Honavar, Vasant
Grossner, Emily
Hillary, Frank G.
author_sort Cwiek, Andrew
collection PubMed
description In this critical review, we examine the application of predictive models, for example, classifiers, trained using machine learning (ML) to assist in interpretation of functional neuroimaging data. Our primary goal is to summarize how ML is being applied and critically assess common practices. Our review covers 250 studies published using ML and resting-state functional MRI (fMRI) to infer various dimensions of the human functional connectome. Results for holdout (“lockbox”) performance was, on average, ∼13% less accurate than performance measured through cross-validation alone, highlighting the importance of lockbox data, which was included in only 16% of the studies. There was also a concerning lack of transparency across the key steps in training and evaluating predictive models. The summary of this literature underscores the importance of the use of a lockbox and highlights several methodological pitfalls that can be addressed by the imaging community. We argue that, ideally, studies are motivated both by the reproducibility and generalizability of findings as well as the potential clinical significance of the insights. We offer recommendations for principled integration of machine learning into the clinical neurosciences with the goal of advancing imaging biomarkers of brain disorders, understanding causative determinants for health risks, and parsing heterogeneous patient outcomes.
format Online
Article
Text
id pubmed-8942606
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MIT Press
record_format MEDLINE/PubMed
spelling pubmed-89426062022-03-28 Feeding the machine: Challenges to reproducible predictive modeling in resting-state connectomics Cwiek, Andrew Rajtmajer, Sarah M. Wyble, Bradley Honavar, Vasant Grossner, Emily Hillary, Frank G. Netw Neurosci Perspective In this critical review, we examine the application of predictive models, for example, classifiers, trained using machine learning (ML) to assist in interpretation of functional neuroimaging data. Our primary goal is to summarize how ML is being applied and critically assess common practices. Our review covers 250 studies published using ML and resting-state functional MRI (fMRI) to infer various dimensions of the human functional connectome. Results for holdout (“lockbox”) performance was, on average, ∼13% less accurate than performance measured through cross-validation alone, highlighting the importance of lockbox data, which was included in only 16% of the studies. There was also a concerning lack of transparency across the key steps in training and evaluating predictive models. The summary of this literature underscores the importance of the use of a lockbox and highlights several methodological pitfalls that can be addressed by the imaging community. We argue that, ideally, studies are motivated both by the reproducibility and generalizability of findings as well as the potential clinical significance of the insights. We offer recommendations for principled integration of machine learning into the clinical neurosciences with the goal of advancing imaging biomarkers of brain disorders, understanding causative determinants for health risks, and parsing heterogeneous patient outcomes. MIT Press 2022-02-01 /pmc/articles/PMC8942606/ /pubmed/35350584 http://dx.doi.org/10.1162/netn_a_00212 Text en © 2021 Massachusetts Institute of Technology https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. For a full description of the license, please visit https://creativecommons.org/licenses/by/4.0/.
spellingShingle Perspective
Cwiek, Andrew
Rajtmajer, Sarah M.
Wyble, Bradley
Honavar, Vasant
Grossner, Emily
Hillary, Frank G.
Feeding the machine: Challenges to reproducible predictive modeling in resting-state connectomics
title Feeding the machine: Challenges to reproducible predictive modeling in resting-state connectomics
title_full Feeding the machine: Challenges to reproducible predictive modeling in resting-state connectomics
title_fullStr Feeding the machine: Challenges to reproducible predictive modeling in resting-state connectomics
title_full_unstemmed Feeding the machine: Challenges to reproducible predictive modeling in resting-state connectomics
title_short Feeding the machine: Challenges to reproducible predictive modeling in resting-state connectomics
title_sort feeding the machine: challenges to reproducible predictive modeling in resting-state connectomics
topic Perspective
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8942606/
https://www.ncbi.nlm.nih.gov/pubmed/35350584
http://dx.doi.org/10.1162/netn_a_00212
work_keys_str_mv AT cwiekandrew feedingthemachinechallengestoreproduciblepredictivemodelinginrestingstateconnectomics
AT rajtmajersarahm feedingthemachinechallengestoreproduciblepredictivemodelinginrestingstateconnectomics
AT wyblebradley feedingthemachinechallengestoreproduciblepredictivemodelinginrestingstateconnectomics
AT honavarvasant feedingthemachinechallengestoreproduciblepredictivemodelinginrestingstateconnectomics
AT grossneremily feedingthemachinechallengestoreproduciblepredictivemodelinginrestingstateconnectomics
AT hillaryfrankg feedingthemachinechallengestoreproduciblepredictivemodelinginrestingstateconnectomics