Cargando…
Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
BACKGROUND: Support vector machines (SVMs) based on brain-wise functional connectivity (FC) have been widely adopted for single-subject prediction of patients with schizophrenia, but most of them had small sample size. This study aimed to evaluate the performance of SVMs based on a large single-site...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cambridge University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792868/ https://www.ncbi.nlm.nih.gov/pubmed/34937587 http://dx.doi.org/10.1192/j.eurpsy.2021.2248 |
_version_ | 1784640474216136704 |
---|---|
author | Lee, Lung-Hao Chen, Chang-Hao Chang, Wan-Chen Lee, Po-Lei Shyu, Kuo-Kai Chen, Mu-Hong Hsu, Ju-Wei Bai, Ya-Mei Su, Tung-Ping Tu, Pei-Chi |
author_facet | Lee, Lung-Hao Chen, Chang-Hao Chang, Wan-Chen Lee, Po-Lei Shyu, Kuo-Kai Chen, Mu-Hong Hsu, Ju-Wei Bai, Ya-Mei Su, Tung-Ping Tu, Pei-Chi |
author_sort | Lee, Lung-Hao |
collection | PubMed |
description | BACKGROUND: Support vector machines (SVMs) based on brain-wise functional connectivity (FC) have been widely adopted for single-subject prediction of patients with schizophrenia, but most of them had small sample size. This study aimed to evaluate the performance of SVMs based on a large single-site dataset and investigate the effects of demographic homogeneity and training sample size on classification accuracy. METHODS: The resting functional Magnetic Resonance Imaging (fMRI) dataset comprised 220 patients with schizophrenia and 220 healthy controls. Brain-wise FCs was calculated for each participant and linear SVMs were developed for automatic classification of patients and controls. First, we evaluated the SVMs based on all participants and homogeneous subsamples of men, women, younger (18–30 years), and older (31–50 years) participants by 10-fold nested cross-validation. Then, we hold out a fixed test set of 40 participants (20 patients and 20 controls) and evaluated the SVMs based on incremental training sample sizes (N = 40, 80, …, 400). RESULTS: We found that the SVMs based on all participants had accuracy of 85.05%. The SVMs based on male, female, young, and older participants yielded accuracy of 84.66, 81.56, 80.50, and 86.13%, respectively. Although the SVMs based on older subsamples had better performance than those based on all participants, they generalized poorly to younger participants (77.24%). For incremental training sizes, the classification accuracy increased stepwise from 72.6 to 83.3%, with >80% accuracy achieved with sample size >240. CONCLUSIONS: The findings indicate that SVMs based on a large dataset yield high classification accuracy and establish models using a large sample size with heterogeneous properties are recommended for single subject prediction of schizophrenia. |
format | Online Article Text |
id | pubmed-8792868 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Cambridge University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-87928682022-02-09 Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants Lee, Lung-Hao Chen, Chang-Hao Chang, Wan-Chen Lee, Po-Lei Shyu, Kuo-Kai Chen, Mu-Hong Hsu, Ju-Wei Bai, Ya-Mei Su, Tung-Ping Tu, Pei-Chi Eur Psychiatry Research Article BACKGROUND: Support vector machines (SVMs) based on brain-wise functional connectivity (FC) have been widely adopted for single-subject prediction of patients with schizophrenia, but most of them had small sample size. This study aimed to evaluate the performance of SVMs based on a large single-site dataset and investigate the effects of demographic homogeneity and training sample size on classification accuracy. METHODS: The resting functional Magnetic Resonance Imaging (fMRI) dataset comprised 220 patients with schizophrenia and 220 healthy controls. Brain-wise FCs was calculated for each participant and linear SVMs were developed for automatic classification of patients and controls. First, we evaluated the SVMs based on all participants and homogeneous subsamples of men, women, younger (18–30 years), and older (31–50 years) participants by 10-fold nested cross-validation. Then, we hold out a fixed test set of 40 participants (20 patients and 20 controls) and evaluated the SVMs based on incremental training sample sizes (N = 40, 80, …, 400). RESULTS: We found that the SVMs based on all participants had accuracy of 85.05%. The SVMs based on male, female, young, and older participants yielded accuracy of 84.66, 81.56, 80.50, and 86.13%, respectively. Although the SVMs based on older subsamples had better performance than those based on all participants, they generalized poorly to younger participants (77.24%). For incremental training sizes, the classification accuracy increased stepwise from 72.6 to 83.3%, with >80% accuracy achieved with sample size >240. CONCLUSIONS: The findings indicate that SVMs based on a large dataset yield high classification accuracy and establish models using a large sample size with heterogeneous properties are recommended for single subject prediction of schizophrenia. Cambridge University Press 2021-12-23 /pmc/articles/PMC8792868/ /pubmed/34937587 http://dx.doi.org/10.1192/j.eurpsy.2021.2248 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc-sa/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use. |
spellingShingle | Research Article Lee, Lung-Hao Chen, Chang-Hao Chang, Wan-Chen Lee, Po-Lei Shyu, Kuo-Kai Chen, Mu-Hong Hsu, Ju-Wei Bai, Ya-Mei Su, Tung-Ping Tu, Pei-Chi Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants |
title | Evaluating the performance of machine learning models for automatic diagnosis
of patients with schizophrenia based on a single site dataset of 440
participants |
title_full | Evaluating the performance of machine learning models for automatic diagnosis
of patients with schizophrenia based on a single site dataset of 440
participants |
title_fullStr | Evaluating the performance of machine learning models for automatic diagnosis
of patients with schizophrenia based on a single site dataset of 440
participants |
title_full_unstemmed | Evaluating the performance of machine learning models for automatic diagnosis
of patients with schizophrenia based on a single site dataset of 440
participants |
title_short | Evaluating the performance of machine learning models for automatic diagnosis
of patients with schizophrenia based on a single site dataset of 440
participants |
title_sort | evaluating the performance of machine learning models for automatic diagnosis
of patients with schizophrenia based on a single site dataset of 440
participants |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792868/ https://www.ncbi.nlm.nih.gov/pubmed/34937587 http://dx.doi.org/10.1192/j.eurpsy.2021.2248 |
work_keys_str_mv | AT leelunghao evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT chenchanghao evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT changwanchen evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT leepolei evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT shyukuokai evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT chenmuhong evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT hsujuwei evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT baiyamei evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT sutungping evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT tupeichi evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants |