Cargando…

Methodologies for Monitoring Mental Health on Twitter: Systematic Review

BACKGROUND: The use of social media data to predict mental health outcomes has the potential to allow for the continuous monitoring of mental health and well-being and provide timely information that can supplement traditional clinical assessments. However, it is crucial that the methodologies used...

Descripción completa

Detalles Bibliográficos
Autores principales: Di Cara, Nina H, Maggio, Valerio, Davis, Oliver S P, Haworth, Claire M A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10203928/
https://www.ncbi.nlm.nih.gov/pubmed/37155236
http://dx.doi.org/10.2196/42734
_version_ 1785045735648002048
author Di Cara, Nina H
Maggio, Valerio
Davis, Oliver S P
Haworth, Claire M A
author_facet Di Cara, Nina H
Maggio, Valerio
Davis, Oliver S P
Haworth, Claire M A
author_sort Di Cara, Nina H
collection PubMed
description BACKGROUND: The use of social media data to predict mental health outcomes has the potential to allow for the continuous monitoring of mental health and well-being and provide timely information that can supplement traditional clinical assessments. However, it is crucial that the methodologies used to create models for this purpose are of high quality from both a mental health and machine learning perspective. Twitter has been a popular choice of social media because of the accessibility of its data, but access to big data sets is not a guarantee of robust results. OBJECTIVE: This study aims to review the current methodologies used in the literature for predicting mental health outcomes from Twitter data, with a focus on the quality of the underlying mental health data and the machine learning methods used. METHODS: A systematic search was performed across 6 databases, using keywords related to mental health disorders, algorithms, and social media. In total, 2759 records were screened, of which 164 (5.94%) papers were analyzed. Information about methodologies for data acquisition, preprocessing, model creation, and validation was collected, as well as information about replicability and ethical considerations. RESULTS: The 164 studies reviewed used 119 primary data sets. There were an additional 8 data sets identified that were not described in enough detail to include, and 6.1% (10/164) of the papers did not describe their data sets at all. Of these 119 data sets, only 16 (13.4%) had access to ground truth data (ie, known characteristics) about the mental health disorders of social media users. The other 86.6% (103/119) of data sets collected data by searching keywords or phrases, which may not be representative of patterns of Twitter use for those with mental health disorders. The annotation of mental health disorders for classification labels was variable, and 57.1% (68/119) of the data sets had no ground truth or clinical input on this annotation. Despite being a common mental health disorder, anxiety received little attention. CONCLUSIONS: The sharing of high-quality ground truth data sets is crucial for the development of trustworthy algorithms that have clinical and research utility. Further collaboration across disciplines and contexts is encouraged to better understand what types of predictions will be useful in supporting the management and identification of mental health disorders. A series of recommendations for researchers in this field and for the wider research community are made, with the aim of enhancing the quality and utility of future outputs.
format Online
Article
Text
id pubmed-10203928
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-102039282023-05-24 Methodologies for Monitoring Mental Health on Twitter: Systematic Review Di Cara, Nina H Maggio, Valerio Davis, Oliver S P Haworth, Claire M A J Med Internet Res Review BACKGROUND: The use of social media data to predict mental health outcomes has the potential to allow for the continuous monitoring of mental health and well-being and provide timely information that can supplement traditional clinical assessments. However, it is crucial that the methodologies used to create models for this purpose are of high quality from both a mental health and machine learning perspective. Twitter has been a popular choice of social media because of the accessibility of its data, but access to big data sets is not a guarantee of robust results. OBJECTIVE: This study aims to review the current methodologies used in the literature for predicting mental health outcomes from Twitter data, with a focus on the quality of the underlying mental health data and the machine learning methods used. METHODS: A systematic search was performed across 6 databases, using keywords related to mental health disorders, algorithms, and social media. In total, 2759 records were screened, of which 164 (5.94%) papers were analyzed. Information about methodologies for data acquisition, preprocessing, model creation, and validation was collected, as well as information about replicability and ethical considerations. RESULTS: The 164 studies reviewed used 119 primary data sets. There were an additional 8 data sets identified that were not described in enough detail to include, and 6.1% (10/164) of the papers did not describe their data sets at all. Of these 119 data sets, only 16 (13.4%) had access to ground truth data (ie, known characteristics) about the mental health disorders of social media users. The other 86.6% (103/119) of data sets collected data by searching keywords or phrases, which may not be representative of patterns of Twitter use for those with mental health disorders. The annotation of mental health disorders for classification labels was variable, and 57.1% (68/119) of the data sets had no ground truth or clinical input on this annotation. Despite being a common mental health disorder, anxiety received little attention. CONCLUSIONS: The sharing of high-quality ground truth data sets is crucial for the development of trustworthy algorithms that have clinical and research utility. Further collaboration across disciplines and contexts is encouraged to better understand what types of predictions will be useful in supporting the management and identification of mental health disorders. A series of recommendations for researchers in this field and for the wider research community are made, with the aim of enhancing the quality and utility of future outputs. JMIR Publications 2023-05-08 /pmc/articles/PMC10203928/ /pubmed/37155236 http://dx.doi.org/10.2196/42734 Text en ©Nina H Di Cara, Valerio Maggio, Oliver S P Davis, Claire M A Haworth. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 08.05.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Review
Di Cara, Nina H
Maggio, Valerio
Davis, Oliver S P
Haworth, Claire M A
Methodologies for Monitoring Mental Health on Twitter: Systematic Review
title Methodologies for Monitoring Mental Health on Twitter: Systematic Review
title_full Methodologies for Monitoring Mental Health on Twitter: Systematic Review
title_fullStr Methodologies for Monitoring Mental Health on Twitter: Systematic Review
title_full_unstemmed Methodologies for Monitoring Mental Health on Twitter: Systematic Review
title_short Methodologies for Monitoring Mental Health on Twitter: Systematic Review
title_sort methodologies for monitoring mental health on twitter: systematic review
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10203928/
https://www.ncbi.nlm.nih.gov/pubmed/37155236
http://dx.doi.org/10.2196/42734
work_keys_str_mv AT dicaraninah methodologiesformonitoringmentalhealthontwittersystematicreview
AT maggiovalerio methodologiesformonitoringmentalhealthontwittersystematicreview
AT davisoliversp methodologiesformonitoringmentalhealthontwittersystematicreview
AT haworthclairema methodologiesformonitoringmentalhealthontwittersystematicreview