Cargando…

Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants

BACKGROUND: Support vector machines (SVMs) based on brain-wise functional connectivity (FC) have been widely adopted for single-subject prediction of patients with schizophrenia, but most of them had small sample size. This study aimed to evaluate the performance of SVMs based on a large single-site...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lee, Lung-Hao, Chen, Chang-Hao, Chang, Wan-Chen, Lee, Po-Lei, Shyu, Kuo-Kai, Chen, Mu-Hong, Hsu, Ju-Wei, Bai, Ya-Mei, Su, Tung-Ping, Tu, Pei-Chi
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Cambridge University Press 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792868/ https://www.ncbi.nlm.nih.gov/pubmed/34937587 http://dx.doi.org/10.1192/j.eurpsy.2021.2248

_version_	1784640474216136704
author	Lee, Lung-Hao Chen, Chang-Hao Chang, Wan-Chen Lee, Po-Lei Shyu, Kuo-Kai Chen, Mu-Hong Hsu, Ju-Wei Bai, Ya-Mei Su, Tung-Ping Tu, Pei-Chi
author_facet	Lee, Lung-Hao Chen, Chang-Hao Chang, Wan-Chen Lee, Po-Lei Shyu, Kuo-Kai Chen, Mu-Hong Hsu, Ju-Wei Bai, Ya-Mei Su, Tung-Ping Tu, Pei-Chi
author_sort	Lee, Lung-Hao
collection	PubMed
description	BACKGROUND: Support vector machines (SVMs) based on brain-wise functional connectivity (FC) have been widely adopted for single-subject prediction of patients with schizophrenia, but most of them had small sample size. This study aimed to evaluate the performance of SVMs based on a large single-site dataset and investigate the effects of demographic homogeneity and training sample size on classification accuracy. METHODS: The resting functional Magnetic Resonance Imaging (fMRI) dataset comprised 220 patients with schizophrenia and 220 healthy controls. Brain-wise FCs was calculated for each participant and linear SVMs were developed for automatic classification of patients and controls. First, we evaluated the SVMs based on all participants and homogeneous subsamples of men, women, younger (18–30 years), and older (31–50 years) participants by 10-fold nested cross-validation. Then, we hold out a fixed test set of 40 participants (20 patients and 20 controls) and evaluated the SVMs based on incremental training sample sizes (N = 40, 80, …, 400). RESULTS: We found that the SVMs based on all participants had accuracy of 85.05%. The SVMs based on male, female, young, and older participants yielded accuracy of 84.66, 81.56, 80.50, and 86.13%, respectively. Although the SVMs based on older subsamples had better performance than those based on all participants, they generalized poorly to younger participants (77.24%). For incremental training sizes, the classification accuracy increased stepwise from 72.6 to 83.3%, with >80% accuracy achieved with sample size >240. CONCLUSIONS: The findings indicate that SVMs based on a large dataset yield high classification accuracy and establish models using a large sample size with heterogeneous properties are recommended for single subject prediction of schizophrenia.
format	Online Article Text
id	pubmed-8792868
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Cambridge University Press
record_format	MEDLINE/PubMed
spelling	pubmed-87928682022-02-09 Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants Lee, Lung-Hao Chen, Chang-Hao Chang, Wan-Chen Lee, Po-Lei Shyu, Kuo-Kai Chen, Mu-Hong Hsu, Ju-Wei Bai, Ya-Mei Su, Tung-Ping Tu, Pei-Chi Eur Psychiatry Research Article BACKGROUND: Support vector machines (SVMs) based on brain-wise functional connectivity (FC) have been widely adopted for single-subject prediction of patients with schizophrenia, but most of them had small sample size. This study aimed to evaluate the performance of SVMs based on a large single-site dataset and investigate the effects of demographic homogeneity and training sample size on classification accuracy. METHODS: The resting functional Magnetic Resonance Imaging (fMRI) dataset comprised 220 patients with schizophrenia and 220 healthy controls. Brain-wise FCs was calculated for each participant and linear SVMs were developed for automatic classification of patients and controls. First, we evaluated the SVMs based on all participants and homogeneous subsamples of men, women, younger (18–30 years), and older (31–50 years) participants by 10-fold nested cross-validation. Then, we hold out a fixed test set of 40 participants (20 patients and 20 controls) and evaluated the SVMs based on incremental training sample sizes (N = 40, 80, …, 400). RESULTS: We found that the SVMs based on all participants had accuracy of 85.05%. The SVMs based on male, female, young, and older participants yielded accuracy of 84.66, 81.56, 80.50, and 86.13%, respectively. Although the SVMs based on older subsamples had better performance than those based on all participants, they generalized poorly to younger participants (77.24%). For incremental training sizes, the classification accuracy increased stepwise from 72.6 to 83.3%, with >80% accuracy achieved with sample size >240. CONCLUSIONS: The findings indicate that SVMs based on a large dataset yield high classification accuracy and establish models using a large sample size with heterogeneous properties are recommended for single subject prediction of schizophrenia. Cambridge University Press 2021-12-23 /pmc/articles/PMC8792868/ /pubmed/34937587 http://dx.doi.org/10.1192/j.eurpsy.2021.2248 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc-sa/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is used to distribute the re-used or adapted article and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use.
spellingShingle	Research Article Lee, Lung-Hao Chen, Chang-Hao Chang, Wan-Chen Lee, Po-Lei Shyu, Kuo-Kai Chen, Mu-Hong Hsu, Ju-Wei Bai, Ya-Mei Su, Tung-Ping Tu, Pei-Chi Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
title	Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
title_full	Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
title_fullStr	Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
title_full_unstemmed	Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
title_short	Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
title_sort	evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792868/ https://www.ncbi.nlm.nih.gov/pubmed/34937587 http://dx.doi.org/10.1192/j.eurpsy.2021.2248
work_keys_str_mv	AT leelunghao evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT chenchanghao evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT changwanchen evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT leepolei evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT shyukuokai evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT chenmuhong evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT hsujuwei evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT baiyamei evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT sutungping evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants AT tupeichi evaluatingtheperformanceofmachinelearningmodelsforautomaticdiagnosisofpatientswithschizophreniabasedonasinglesitedatasetof440participants

Evaluating the performance of machine learning models for automatic diagnosis of patients with schizophrenia based on a single site dataset of 440 participants

Ejemplares similares