Cargando…

Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study

BACKGROUND: Mental illness affects a significant portion of the worldwide population. Online mental health forums can provide a supportive environment for those afflicted and also generate a large amount of data that can be mined to predict mental health states using machine learning methods. OBJECT...

Descripción completa

Detalles Bibliográficos
Autores principales: Howard, Derek, Maslej, Marta M, Lee, Justin, Ritchie, Jacob, Woollard, Geoffrey, French, Leon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7254287/
https://www.ncbi.nlm.nih.gov/pubmed/32401222
http://dx.doi.org/10.2196/15371
_version_ 1783539509612249088
author Howard, Derek
Maslej, Marta M
Lee, Justin
Ritchie, Jacob
Woollard, Geoffrey
French, Leon
author_facet Howard, Derek
Maslej, Marta M
Lee, Justin
Ritchie, Jacob
Woollard, Geoffrey
French, Leon
author_sort Howard, Derek
collection PubMed
description BACKGROUND: Mental illness affects a significant portion of the worldwide population. Online mental health forums can provide a supportive environment for those afflicted and also generate a large amount of data that can be mined to predict mental health states using machine learning methods. OBJECTIVE: This study aimed to benchmark multiple methods of text feature representation for social media posts and compare their downstream use with automated machine learning (AutoML) tools. We tested on datasets that contain posts labeled for perceived suicide risk or moderator attention in the context of self-harm. Specifically, we assessed the ability of the methods to prioritize posts that a moderator would identify for immediate response. METHODS: We used 1588 labeled posts from the Computational Linguistics and Clinical Psychology (CLPsych) 2017 shared task collected from the Reachout.com forum. Posts were represented using lexicon-based tools, including Valence Aware Dictionary and sEntiment Reasoner, Empath, and Linguistic Inquiry and Word Count, and also using pretrained artificial neural network models, including DeepMoji, Universal Sentence Encoder, and Generative Pretrained Transformer-1 (GPT-1). We used Tree-based Optimization Tool and Auto-Sklearn as AutoML tools to generate classifiers to triage the posts. RESULTS: The top-performing system used features derived from the GPT-1 model, which was fine-tuned on over 150,000 unlabeled posts from Reachout.com. Our top system had a macroaveraged F1 score of 0.572, providing a new state-of-the-art result on the CLPsych 2017 task. This was achieved without additional information from metadata or preceding posts. Error analyses revealed that this top system often misses expressions of hopelessness. In addition, we have presented visualizations that aid in the understanding of the learned classifiers. CONCLUSIONS: In this study, we found that transfer learning is an effective strategy for predicting risk with relatively little labeled data and noted that fine-tuning of pretrained language models provides further gains when large amounts of unlabeled text are available.
format Online
Article
Text
id pubmed-7254287
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-72542872020-06-03 Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study Howard, Derek Maslej, Marta M Lee, Justin Ritchie, Jacob Woollard, Geoffrey French, Leon J Med Internet Res Original Paper BACKGROUND: Mental illness affects a significant portion of the worldwide population. Online mental health forums can provide a supportive environment for those afflicted and also generate a large amount of data that can be mined to predict mental health states using machine learning methods. OBJECTIVE: This study aimed to benchmark multiple methods of text feature representation for social media posts and compare their downstream use with automated machine learning (AutoML) tools. We tested on datasets that contain posts labeled for perceived suicide risk or moderator attention in the context of self-harm. Specifically, we assessed the ability of the methods to prioritize posts that a moderator would identify for immediate response. METHODS: We used 1588 labeled posts from the Computational Linguistics and Clinical Psychology (CLPsych) 2017 shared task collected from the Reachout.com forum. Posts were represented using lexicon-based tools, including Valence Aware Dictionary and sEntiment Reasoner, Empath, and Linguistic Inquiry and Word Count, and also using pretrained artificial neural network models, including DeepMoji, Universal Sentence Encoder, and Generative Pretrained Transformer-1 (GPT-1). We used Tree-based Optimization Tool and Auto-Sklearn as AutoML tools to generate classifiers to triage the posts. RESULTS: The top-performing system used features derived from the GPT-1 model, which was fine-tuned on over 150,000 unlabeled posts from Reachout.com. Our top system had a macroaveraged F1 score of 0.572, providing a new state-of-the-art result on the CLPsych 2017 task. This was achieved without additional information from metadata or preceding posts. Error analyses revealed that this top system often misses expressions of hopelessness. In addition, we have presented visualizations that aid in the understanding of the learned classifiers. CONCLUSIONS: In this study, we found that transfer learning is an effective strategy for predicting risk with relatively little labeled data and noted that fine-tuning of pretrained language models provides further gains when large amounts of unlabeled text are available. JMIR Publications 2020-05-13 /pmc/articles/PMC7254287/ /pubmed/32401222 http://dx.doi.org/10.2196/15371 Text en ©Derek Howard, Marta M Maslej, Justin Lee, Jacob Ritchie, Geoffrey Woollard, Leon French. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 13.05.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Howard, Derek
Maslej, Marta M
Lee, Justin
Ritchie, Jacob
Woollard, Geoffrey
French, Leon
Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study
title Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study
title_full Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study
title_fullStr Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study
title_full_unstemmed Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study
title_short Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study
title_sort transfer learning for risk classification of social media posts: model evaluation study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7254287/
https://www.ncbi.nlm.nih.gov/pubmed/32401222
http://dx.doi.org/10.2196/15371
work_keys_str_mv AT howardderek transferlearningforriskclassificationofsocialmediapostsmodelevaluationstudy
AT maslejmartam transferlearningforriskclassificationofsocialmediapostsmodelevaluationstudy
AT leejustin transferlearningforriskclassificationofsocialmediapostsmodelevaluationstudy
AT ritchiejacob transferlearningforriskclassificationofsocialmediapostsmodelevaluationstudy
AT woollardgeoffrey transferlearningforriskclassificationofsocialmediapostsmodelevaluationstudy
AT frenchleon transferlearningforriskclassificationofsocialmediapostsmodelevaluationstudy