Cargando…

Vocal markers of schizophrenia: assessing the generalizability of machine learning models and their clinical applicability

INTRODUCTION: Machine learning (ML) approaches are a promising venue for identifying vocal markers of neuropsychiatric disorders, such as schizophrenia. While recent studies have shown that voice-based ML models can reliably predict diagnosis and clinical symptoms of schizophrenia, it is unclear to...

Descripción completa

Detalles Bibliográficos
Autores principales:	Parola, A., Rybner, A., Jessen, E. T., Damsgaard Mortensen, M., Nyhus Larsen, S., Simonsen, A., Zhou, Y., Koelkebeck, K., Bliksted, V., Fusaroli, R.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Cambridge University Press 2023
Materias:	Abstract
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10596474/ http://dx.doi.org/10.1192/j.eurpsy.2023.444

_version_	1785125112986468352
author	Parola, A. Rybner, A. Jessen, E. T. Damsgaard Mortensen, M. Nyhus Larsen, S. Simonsen, A. Zhou, Y. Koelkebeck, K. Bliksted, V. Fusaroli, R.
author_facet	Parola, A. Rybner, A. Jessen, E. T. Damsgaard Mortensen, M. Nyhus Larsen, S. Simonsen, A. Zhou, Y. Koelkebeck, K. Bliksted, V. Fusaroli, R.
author_sort	Parola, A.
collection	PubMed
description	INTRODUCTION: Machine learning (ML) approaches are a promising venue for identifying vocal markers of neuropsychiatric disorders, such as schizophrenia. While recent studies have shown that voice-based ML models can reliably predict diagnosis and clinical symptoms of schizophrenia, it is unclear to what extent such ML markers generalize to new speech samples collected using a different task or in a different language: the assessment of generalization performance is however crucial for testing their clinical applicability. OBJECTIVES: In this research, we systematically assessed the generalizability of ML models across contexts and languages relying on a large cross-linguistic dataset of audio recordings of patients with schizophrenia and controls. METHODS: We trained ML models of vocal markers of schizophrenia on a large cross-linguistic dataset of audio recordings of 231 patients with schizophrenia and 238 matched controls (>4.000 recordings in Danish, German, Mandarin and Japanese). We developed a rigorous pipeline to minimize overfitting, including cross-validated training set and Mixture of Experts (MoE) models. We tested the generalizability of the ML models on: (i) different participants, speaking the same language (hold-out test set); (ii) different participants, speaking a different language. Finally, we compared the predictive performance of: (i) models trained on a single language (e.g., Danish) (ii) MoE models, i.e., ensemble of models (experts) trained on a single language whose predictions are combined using a weighted sum (iii) multi-language models trained on multiple languages (e.g., Danish and German). RESULTS: Model performance was comparable to state-of-the art findings (F1: 70%-80%) when trained and tested on participants speaking the same language (out-of-sample performance). Crucially, however, the ML models did not generalize well - showing a substantial decrease of performance (close to chance) - when trained in a language and tested on new languages (e.g., trained on Danish and tested on German). MoE and multi-language models showed a better increase of performance (F1: 55%-60%), but still far from those requested for achieving clinical applicability. CONCLUSIONS: Our results show that the cross-linguistic generalizability of ML models of vocal markers of schizophrenia is very limited. This is an issue if our first goal is to translate these vocal markers into effective clinical applications. We argue that more emphasis needs to be placed on collecting large open datasets to test the generalizability of voice-based ML models, for example, across different speech tasks or across the heterogeneous clinical profiles that characterize schizophrenia spectrum disorder. DISCLOSURE OF INTEREST: None Declared
format	Online Article Text
id	pubmed-10596474
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Cambridge University Press
record_format	MEDLINE/PubMed
spelling	pubmed-105964742023-10-25 Vocal markers of schizophrenia: assessing the generalizability of machine learning models and their clinical applicability Parola, A. Rybner, A. Jessen, E. T. Damsgaard Mortensen, M. Nyhus Larsen, S. Simonsen, A. Zhou, Y. Koelkebeck, K. Bliksted, V. Fusaroli, R. Eur Psychiatry Abstract INTRODUCTION: Machine learning (ML) approaches are a promising venue for identifying vocal markers of neuropsychiatric disorders, such as schizophrenia. While recent studies have shown that voice-based ML models can reliably predict diagnosis and clinical symptoms of schizophrenia, it is unclear to what extent such ML markers generalize to new speech samples collected using a different task or in a different language: the assessment of generalization performance is however crucial for testing their clinical applicability. OBJECTIVES: In this research, we systematically assessed the generalizability of ML models across contexts and languages relying on a large cross-linguistic dataset of audio recordings of patients with schizophrenia and controls. METHODS: We trained ML models of vocal markers of schizophrenia on a large cross-linguistic dataset of audio recordings of 231 patients with schizophrenia and 238 matched controls (>4.000 recordings in Danish, German, Mandarin and Japanese). We developed a rigorous pipeline to minimize overfitting, including cross-validated training set and Mixture of Experts (MoE) models. We tested the generalizability of the ML models on: (i) different participants, speaking the same language (hold-out test set); (ii) different participants, speaking a different language. Finally, we compared the predictive performance of: (i) models trained on a single language (e.g., Danish) (ii) MoE models, i.e., ensemble of models (experts) trained on a single language whose predictions are combined using a weighted sum (iii) multi-language models trained on multiple languages (e.g., Danish and German). RESULTS: Model performance was comparable to state-of-the art findings (F1: 70%-80%) when trained and tested on participants speaking the same language (out-of-sample performance). Crucially, however, the ML models did not generalize well - showing a substantial decrease of performance (close to chance) - when trained in a language and tested on new languages (e.g., trained on Danish and tested on German). MoE and multi-language models showed a better increase of performance (F1: 55%-60%), but still far from those requested for achieving clinical applicability. CONCLUSIONS: Our results show that the cross-linguistic generalizability of ML models of vocal markers of schizophrenia is very limited. This is an issue if our first goal is to translate these vocal markers into effective clinical applications. We argue that more emphasis needs to be placed on collecting large open datasets to test the generalizability of voice-based ML models, for example, across different speech tasks or across the heterogeneous clinical profiles that characterize schizophrenia spectrum disorder. DISCLOSURE OF INTEREST: None Declared Cambridge University Press 2023-07-19 /pmc/articles/PMC10596474/ http://dx.doi.org/10.1192/j.eurpsy.2023.444 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Abstract Parola, A. Rybner, A. Jessen, E. T. Damsgaard Mortensen, M. Nyhus Larsen, S. Simonsen, A. Zhou, Y. Koelkebeck, K. Bliksted, V. Fusaroli, R. Vocal markers of schizophrenia: assessing the generalizability of machine learning models and their clinical applicability
title	Vocal markers of schizophrenia: assessing the generalizability of machine learning models and their clinical applicability
title_full	Vocal markers of schizophrenia: assessing the generalizability of machine learning models and their clinical applicability
title_fullStr	Vocal markers of schizophrenia: assessing the generalizability of machine learning models and their clinical applicability
title_full_unstemmed	Vocal markers of schizophrenia: assessing the generalizability of machine learning models and their clinical applicability
title_short	Vocal markers of schizophrenia: assessing the generalizability of machine learning models and their clinical applicability
title_sort	vocal markers of schizophrenia: assessing the generalizability of machine learning models and their clinical applicability
topic	Abstract
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10596474/ http://dx.doi.org/10.1192/j.eurpsy.2023.444
work_keys_str_mv	AT parolaa vocalmarkersofschizophreniaassessingthegeneralizabilityofmachinelearningmodelsandtheirclinicalapplicability AT rybnera vocalmarkersofschizophreniaassessingthegeneralizabilityofmachinelearningmodelsandtheirclinicalapplicability AT jessenet vocalmarkersofschizophreniaassessingthegeneralizabilityofmachinelearningmodelsandtheirclinicalapplicability AT damsgaardmortensenm vocalmarkersofschizophreniaassessingthegeneralizabilityofmachinelearningmodelsandtheirclinicalapplicability AT nyhuslarsens vocalmarkersofschizophreniaassessingthegeneralizabilityofmachinelearningmodelsandtheirclinicalapplicability AT simonsena vocalmarkersofschizophreniaassessingthegeneralizabilityofmachinelearningmodelsandtheirclinicalapplicability AT zhouy vocalmarkersofschizophreniaassessingthegeneralizabilityofmachinelearningmodelsandtheirclinicalapplicability AT koelkebeckk vocalmarkersofschizophreniaassessingthegeneralizabilityofmachinelearningmodelsandtheirclinicalapplicability AT blikstedv vocalmarkersofschizophreniaassessingthegeneralizabilityofmachinelearningmodelsandtheirclinicalapplicability AT fusarolir vocalmarkersofschizophreniaassessingthegeneralizabilityofmachinelearningmodelsandtheirclinicalapplicability

Vocal markers of schizophrenia: assessing the generalizability of machine learning models and their clinical applicability

Ejemplares similares