Cargando…

Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies

Mobile sensing data processed using machine learning models can passively and remotely assess mental health symptoms from the context of patients’ lives. Prior work has trained models using data from single longitudinal studies, collected from demographically homogeneous populations, over short time...

Descripción completa

Detalles Bibliográficos
Autores principales: Adler, Daniel A., Wang, Fei, Mohr, David C., Choudhury, Tanzeem
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9045602/
https://www.ncbi.nlm.nih.gov/pubmed/35476787
http://dx.doi.org/10.1371/journal.pone.0266516
_version_ 1784695349973090304
author Adler, Daniel A.
Wang, Fei
Mohr, David C.
Choudhury, Tanzeem
author_facet Adler, Daniel A.
Wang, Fei
Mohr, David C.
Choudhury, Tanzeem
author_sort Adler, Daniel A.
collection PubMed
description Mobile sensing data processed using machine learning models can passively and remotely assess mental health symptoms from the context of patients’ lives. Prior work has trained models using data from single longitudinal studies, collected from demographically homogeneous populations, over short time periods, using a single data collection platform or mobile application. The generalizability of model performance across studies has not been assessed. This study presents a first analysis to understand if models trained using combined longitudinal study data to predict mental health symptoms generalize across current publicly available data. We combined data from the CrossCheck (individuals living with schizophrenia) and StudentLife (university students) studies. In addition to assessing generalizability, we explored if personalizing models to align mobile sensing data, and oversampling less-represented severe symptoms, improved model performance. Leave-one-subject-out cross-validation (LOSO-CV) results were reported. Two symptoms (sleep quality and stress) had similar question-response structures across studies and were used as outcomes to explore cross-dataset prediction. Models trained with combined data were more likely to be predictive (significant improvement over predicting training data mean) than models trained with single-study data. Expected model performance improved if the distance between training and validation feature distributions decreased using combined versus single-study data. Personalization aligned each LOSO-CV participant with training data, but only improved predicting CrossCheck stress. Oversampling significantly improved severe symptom classification sensitivity and positive predictive value, but decreased model specificity. Taken together, these results show that machine learning models trained on combined longitudinal study data may generalize across heterogeneous datasets. We encourage researchers to disseminate collected de-identified mobile sensing and mental health symptom data, and further standardize data types collected across studies to enable better assessment of model generalizability.
format Online
Article
Text
id pubmed-9045602
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-90456022022-04-28 Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies Adler, Daniel A. Wang, Fei Mohr, David C. Choudhury, Tanzeem PLoS One Research Article Mobile sensing data processed using machine learning models can passively and remotely assess mental health symptoms from the context of patients’ lives. Prior work has trained models using data from single longitudinal studies, collected from demographically homogeneous populations, over short time periods, using a single data collection platform or mobile application. The generalizability of model performance across studies has not been assessed. This study presents a first analysis to understand if models trained using combined longitudinal study data to predict mental health symptoms generalize across current publicly available data. We combined data from the CrossCheck (individuals living with schizophrenia) and StudentLife (university students) studies. In addition to assessing generalizability, we explored if personalizing models to align mobile sensing data, and oversampling less-represented severe symptoms, improved model performance. Leave-one-subject-out cross-validation (LOSO-CV) results were reported. Two symptoms (sleep quality and stress) had similar question-response structures across studies and were used as outcomes to explore cross-dataset prediction. Models trained with combined data were more likely to be predictive (significant improvement over predicting training data mean) than models trained with single-study data. Expected model performance improved if the distance between training and validation feature distributions decreased using combined versus single-study data. Personalization aligned each LOSO-CV participant with training data, but only improved predicting CrossCheck stress. Oversampling significantly improved severe symptom classification sensitivity and positive predictive value, but decreased model specificity. Taken together, these results show that machine learning models trained on combined longitudinal study data may generalize across heterogeneous datasets. We encourage researchers to disseminate collected de-identified mobile sensing and mental health symptom data, and further standardize data types collected across studies to enable better assessment of model generalizability. Public Library of Science 2022-04-27 /pmc/articles/PMC9045602/ /pubmed/35476787 http://dx.doi.org/10.1371/journal.pone.0266516 Text en © 2022 Adler et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Adler, Daniel A.
Wang, Fei
Mohr, David C.
Choudhury, Tanzeem
Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies
title Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies
title_full Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies
title_fullStr Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies
title_full_unstemmed Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies
title_short Machine learning for passive mental health symptom prediction: Generalization across different longitudinal mobile sensing studies
title_sort machine learning for passive mental health symptom prediction: generalization across different longitudinal mobile sensing studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9045602/
https://www.ncbi.nlm.nih.gov/pubmed/35476787
http://dx.doi.org/10.1371/journal.pone.0266516
work_keys_str_mv AT adlerdaniela machinelearningforpassivementalhealthsymptompredictiongeneralizationacrossdifferentlongitudinalmobilesensingstudies
AT wangfei machinelearningforpassivementalhealthsymptompredictiongeneralizationacrossdifferentlongitudinalmobilesensingstudies
AT mohrdavidc machinelearningforpassivementalhealthsymptompredictiongeneralizationacrossdifferentlongitudinalmobilesensingstudies
AT choudhurytanzeem machinelearningforpassivementalhealthsymptompredictiongeneralizationacrossdifferentlongitudinalmobilesensingstudies