Cargando…
Generalizability of machine learning for classification of schizophrenia based on resting‐state functional MRI data
Machine learning has increasingly been applied to classification of schizophrenia in neuroimaging research. However, direct replication studies and studies seeking to investigate generalizability are scarce. To address these issues, we assessed within‐site and between‐site generalizability of a mach...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley & Sons, Inc.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7268030/ https://www.ncbi.nlm.nih.gov/pubmed/31571320 http://dx.doi.org/10.1002/hbm.24797 |
_version_ | 1783541530201423872 |
---|---|
author | Cai, Xin‐Lu Xie, Dong‐Jie Madsen, Kristoffer H. Wang, Yong‐Ming Bögemann, Sophie Alida Cheung, Eric F. C. Møller, Arne Chan, Raymond C. K. |
author_facet | Cai, Xin‐Lu Xie, Dong‐Jie Madsen, Kristoffer H. Wang, Yong‐Ming Bögemann, Sophie Alida Cheung, Eric F. C. Møller, Arne Chan, Raymond C. K. |
author_sort | Cai, Xin‐Lu |
collection | PubMed |
description | Machine learning has increasingly been applied to classification of schizophrenia in neuroimaging research. However, direct replication studies and studies seeking to investigate generalizability are scarce. To address these issues, we assessed within‐site and between‐site generalizability of a machine learning classification framework which achieved excellent performance in a previous study using two independent resting‐state functional magnetic resonance imaging data sets collected from different sites and scanners. We established within‐site generalizability of the classification framework in the main data set using cross‐validation. Then, we trained a model in the main data set and investigated between‐site generalization in the validated data set using external validation. Finally, recognizing the poor between‐site generalization performance, we updated the unsupervised algorithm to investigate if transfer learning using additional unlabeled data were able to improve between‐site classification performance. Cross‐validation showed that the published classification procedure achieved an accuracy of 0.73 using majority voting across all selected components. External validation found a classification accuracy of 0.55 (not significant) and 0.70 (significant) using the direct and transfer learning procedures, respectively. The failure of direct generalization from one site to another demonstrates the limitation of within‐site cross‐validation and points toward the need to incorporate efforts to facilitate application of machine learning across multiple data sets. The improvement in performance with transfer learning highlights the importance of taking into account the properties of data when constructing predictive models across samples and sites. Our findings suggest that machine learning classification result based on a single study should be interpreted cautiously. |
format | Online Article Text |
id | pubmed-7268030 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | John Wiley & Sons, Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-72680302020-06-12 Generalizability of machine learning for classification of schizophrenia based on resting‐state functional MRI data Cai, Xin‐Lu Xie, Dong‐Jie Madsen, Kristoffer H. Wang, Yong‐Ming Bögemann, Sophie Alida Cheung, Eric F. C. Møller, Arne Chan, Raymond C. K. Hum Brain Mapp Research Articles Machine learning has increasingly been applied to classification of schizophrenia in neuroimaging research. However, direct replication studies and studies seeking to investigate generalizability are scarce. To address these issues, we assessed within‐site and between‐site generalizability of a machine learning classification framework which achieved excellent performance in a previous study using two independent resting‐state functional magnetic resonance imaging data sets collected from different sites and scanners. We established within‐site generalizability of the classification framework in the main data set using cross‐validation. Then, we trained a model in the main data set and investigated between‐site generalization in the validated data set using external validation. Finally, recognizing the poor between‐site generalization performance, we updated the unsupervised algorithm to investigate if transfer learning using additional unlabeled data were able to improve between‐site classification performance. Cross‐validation showed that the published classification procedure achieved an accuracy of 0.73 using majority voting across all selected components. External validation found a classification accuracy of 0.55 (not significant) and 0.70 (significant) using the direct and transfer learning procedures, respectively. The failure of direct generalization from one site to another demonstrates the limitation of within‐site cross‐validation and points toward the need to incorporate efforts to facilitate application of machine learning across multiple data sets. The improvement in performance with transfer learning highlights the importance of taking into account the properties of data when constructing predictive models across samples and sites. Our findings suggest that machine learning classification result based on a single study should be interpreted cautiously. John Wiley & Sons, Inc. 2019-10-01 /pmc/articles/PMC7268030/ /pubmed/31571320 http://dx.doi.org/10.1002/hbm.24797 Text en © 2019 The Authors. Human Brain Mapping published by Wiley Periodicals, Inc. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Cai, Xin‐Lu Xie, Dong‐Jie Madsen, Kristoffer H. Wang, Yong‐Ming Bögemann, Sophie Alida Cheung, Eric F. C. Møller, Arne Chan, Raymond C. K. Generalizability of machine learning for classification of schizophrenia based on resting‐state functional MRI data |
title | Generalizability of machine learning for classification of schizophrenia based on resting‐state functional MRI data |
title_full | Generalizability of machine learning for classification of schizophrenia based on resting‐state functional MRI data |
title_fullStr | Generalizability of machine learning for classification of schizophrenia based on resting‐state functional MRI data |
title_full_unstemmed | Generalizability of machine learning for classification of schizophrenia based on resting‐state functional MRI data |
title_short | Generalizability of machine learning for classification of schizophrenia based on resting‐state functional MRI data |
title_sort | generalizability of machine learning for classification of schizophrenia based on resting‐state functional mri data |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7268030/ https://www.ncbi.nlm.nih.gov/pubmed/31571320 http://dx.doi.org/10.1002/hbm.24797 |
work_keys_str_mv | AT caixinlu generalizabilityofmachinelearningforclassificationofschizophreniabasedonrestingstatefunctionalmridata AT xiedongjie generalizabilityofmachinelearningforclassificationofschizophreniabasedonrestingstatefunctionalmridata AT madsenkristofferh generalizabilityofmachinelearningforclassificationofschizophreniabasedonrestingstatefunctionalmridata AT wangyongming generalizabilityofmachinelearningforclassificationofschizophreniabasedonrestingstatefunctionalmridata AT bogemannsophiealida generalizabilityofmachinelearningforclassificationofschizophreniabasedonrestingstatefunctionalmridata AT cheungericfc generalizabilityofmachinelearningforclassificationofschizophreniabasedonrestingstatefunctionalmridata AT møllerarne generalizabilityofmachinelearningforclassificationofschizophreniabasedonrestingstatefunctionalmridata AT chanraymondck generalizabilityofmachinelearningforclassificationofschizophreniabasedonrestingstatefunctionalmridata |