Cargando…

Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants

BACKGROUND: Support vector machines (SVMs) based on brain-wise functional connectivity (FC) have been widely adopted for single-subject prediction of patients with schizophrenia, but most of them had small sample size. This study aimed to evaluate the performance of SVMs based on a large single-site...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Lung-Hao, Chen, Chang-Hao, Chang, Wan-Chen, Lee, Po-Lei, Shyu, Kuo-Kai, Chen, Mu-Hong, Hsu, Ju-Wei, Bai, Ya-Mei, Su, Tung-Ping, Tu, Pei-Chi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cambridge University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792868/
https://www.ncbi.nlm.nih.gov/pubmed/34937587
http://dx.doi.org/10.1192/j.eurpsy.2021.2248
_version_ 1784640474216136704
author Lee, Lung-Hao
Chen, Chang-Hao
Chang, Wan-Chen
Lee, Po-Lei
Shyu, Kuo-Kai
Chen, Mu-Hong
Hsu, Ju-Wei
Bai, Ya-Mei
Su, Tung-Ping
Tu, Pei-Chi
author_facet Lee, Lung-Hao
Chen, Chang-Hao
Chang, Wan-Chen
Lee, Po-Lei
Shyu, Kuo-Kai
Chen, Mu-Hong
Hsu, Ju-Wei
Bai, Ya-Mei
Su, Tung-Ping
Tu, Pei-Chi
author_sort Lee, Lung-Hao
collection PubMed
description BACKGROUND: Support vector machines (SVMs) based on brain-wise functional connectivity (FC) have been widely adopted for single-subject prediction of patients with schizophrenia, but most of them had small sample size. This study aimed to evaluate the performance of SVMs based on a large single-site dataset and investigate the effects of demographic homogeneity and training sample size on classification accuracy. METHODS: The resting functional Magnetic Resonance Imaging (fMRI) dataset comprised 220 patients with schizophrenia and 220 healthy controls. Brain-wise FCs was calculated for each participant and linear SVMs were developed for automatic classification of patients and controls. First, we evaluated the SVMs based on all participants and homogeneous subsamples of men, women, younger (18–30 years), and older (31–50 years) participants by 10-fold nested cross-validation. Then, we hold out a fixed test set of 40 participants (20 patients and 20 controls) and evaluated the SVMs based on incremental training sample sizes (N = 40, 80, …, 400). RESULTS: We found that the SVMs based on all participants had accuracy of 85.05%. The SVMs based on male, female, young, and older participants yielded accuracy of 84.66, 81.56, 80.50, and 86.13%, respectively. Although the SVMs based on older subsamples had better performance than those based on all participants, they generalized poorly to younger participants (77.24%). For incremental training sizes, the classification accuracy increased stepwise from 72.6 to 83.3%, with >80% accuracy achieved with sample size >240. CONCLUSIONS: The findings indicate that SVMs based on a large dataset yield high classification accuracy and establish models using a large sample size with heterogeneous properties are recommended for single subject prediction of schizophrenia.
format Online
Article
Text
id pubmed-8792868
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Cambridge University Press
record_format MEDLINE/PubMed
spelling pubmed-87928682022-02-09 Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants Lee, Lung-Hao Chen, Chang-Hao Chang, Wan-Chen Lee, Po-Lei Shyu, Kuo-Kai Chen, Mu-Hong Hsu, Ju-Wei Bai, Ya-Mei Su, Tung-Ping Tu, Pei-Chi Eur Psychiatry Research Article BACKGROUND: Support vector machines (SVMs) based on brain-wise functional connectivity (FC) have been widely adopted for single-subject prediction of patients with schizophrenia, but most of them had small sample size. This study aimed to evaluate the performance of SVMs based on a large single-site dataset and investigate the effects of demographic homogeneity and training sample size on classification accuracy. METHODS: The resting functional Magnetic Resonance Imaging (fMRI) dataset comprised 220 patients with schizophrenia and 220 healthy controls. Brain-wise FCs was calculated for each participant and linear SVMs were developed for automatic classification of patients and controls. First, we evaluated the SVMs based on all participants and homogeneous subsamples of men, women, younger (18–30 years), and older (31–50 years) participants by 10-fold nested cross-validation. Then, we hold out a fixed test set of 40 participants (20 patients and 20 controls) and evaluated the SVMs based on incremental training sample sizes (N = 40, 80, …, 400). RESULTS: We found that the SVMs based on all participants had accuracy of 85.05%. The SVMs based on male, female, young, and older participants yielded accuracy of 84.66, 81.56, 80.50, and 86.13%, respectively. Although the SVMs based on older subsamples had better performance than those based on all participants, they generalized poorly to younger participants (77.24%). For incremental training sizes, the classification accuracy increased stepwise from 72.6 to 83.3%, with >80% accuracy achieved with sample size >240. CONCLUSIONS: The findings indicate that SVMs based on a large dataset yield high classification accuracy and establish models using a large sample size with heterogeneous properties are recommended for single subject prediction of schizophrenia. Cambridge University Press 2021-12-23 /pmc/articles/PMC8792868/ /pubmed/34937587 http://dx.doi.org/10.1192/j.eurpsy.2021.2248 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc-sa/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
spellingShingle Research Article
Lee, Lung-Hao
Chen, Chang-Hao
Chang, Wan-Chen
Lee, Po-Lei
Shyu, Kuo-Kai
Chen, Mu-Hong
Hsu, Ju-Wei
Bai, Ya-Mei
Su, Tung-Ping
Tu, Pei-Chi
Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
title Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
title_full Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
title_fullStr Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
title_full_unstemmed Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
title_short Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
title_sort evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792868/
https://www.ncbi.nlm.nih.gov/pubmed/34937587
http://dx.doi.org/10.1192/j.eurpsy.2021.2248
work_keys_str_mv AT leelunghao evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants
AT chenchanghao evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants
AT changwanchen evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants
AT leepolei evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants
AT shyukuokai evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants
AT chenmuhong evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants
AT hsujuwei evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants
AT baiyamei evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants
AT sutungping evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants
AT tupeichi evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants