Cargando…

Detection of Depression Severity Using Bengali Social Media Posts on Mental Health: Study Using Natural Language Processing Techniques

BACKGROUND: There are a myriad of language cues that indicate depression in written texts, and natural language processing (NLP) researchers have proven the ability of machine learning and deep learning approaches to detect these cues. However, to date, these approaches bridging NLP and the domain o...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kabir, Muhammad Khubayeeb, Islam, Maisha, Kabir, Anika Nahian Binte, Haque, Adiba, Rhaman, Md Khalilur
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2022
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9557762/ https://www.ncbi.nlm.nih.gov/pubmed/36169989 http://dx.doi.org/10.2196/36118

_version_	1784807297107623936
author	Kabir, Muhammad Khubayeeb Islam, Maisha Kabir, Anika Nahian Binte Haque, Adiba Rhaman, Md Khalilur
author_facet	Kabir, Muhammad Khubayeeb Islam, Maisha Kabir, Anika Nahian Binte Haque, Adiba Rhaman, Md Khalilur
author_sort	Kabir, Muhammad Khubayeeb
collection	PubMed
description	BACKGROUND: There are a myriad of language cues that indicate depression in written texts, and natural language processing (NLP) researchers have proven the ability of machine learning and deep learning approaches to detect these cues. However, to date, these approaches bridging NLP and the domain of mental health for Bengali literature are not comprehensive. The Bengali-speaking population can express emotions in their native language in greater detail. OBJECTIVE: Our goal is to detect the severity of depression using Bengali texts by generating a novel Bengali corpus of depressive posts. We collaborated with mental health experts to generate a clinically sound labeling scheme and an annotated corpus to train machine learning and deep learning models. METHODS: We conducted a study using Bengali text-based data from blogs and open source platforms. We constructed a procedure for annotated corpus generation and extraction of textual information from Bengali literature for predictive analysis. We developed our own structured data set and designed a clinically sound labeling scheme with the help of mental health professionals, adhering to the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) during the process. We used 5 machine learning models for detecting the severity of depression: kernel support vector machine (SVM), random forest, logistic regression K-nearest neighbor (KNN), and complement naive Bayes (NB). For the deep learning approach, we used long short-term memory (LSTM) units and gated recurrent units (GRUs) coupled with convolutional blocks or self-attention layers. Finally, we aimed for enhanced outcomes by using state-of-the-art pretrained language models. RESULTS: The independent recurrent neural network (RNN) models yielded the highest accuracies and weighted F1 scores. GRUs, in particular, produced 81% accuracy. The hybrid architectures could not surpass the RNNs in terms of performance. Kernel SVM with term frequency–inverse document frequency (TF-IDF) embeddings generated 78% accuracy on test data. We used validation and training loss curves to observe and report the performance of our architectures. Overall, the number of available data remained the limitation of our experiment. CONCLUSIONS: The findings from our experimental setup indicate that machine learning and deep learning models are fairly capable of assessing the severity of mental health issues from texts. For the future, we suggest more research endeavors to increase the volume of Bengali text data, in particular, so that modern architectures reach improved generalization capability.
format	Online Article Text
id	pubmed-9557762
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-95577622022-10-14 Detection of Depression Severity Using Bengali Social Media Posts on Mental Health: Study Using Natural Language Processing Techniques Kabir, Muhammad Khubayeeb Islam, Maisha Kabir, Anika Nahian Binte Haque, Adiba Rhaman, Md Khalilur JMIR Form Res Original Paper BACKGROUND: There are a myriad of language cues that indicate depression in written texts, and natural language processing (NLP) researchers have proven the ability of machine learning and deep learning approaches to detect these cues. However, to date, these approaches bridging NLP and the domain of mental health for Bengali literature are not comprehensive. The Bengali-speaking population can express emotions in their native language in greater detail. OBJECTIVE: Our goal is to detect the severity of depression using Bengali texts by generating a novel Bengali corpus of depressive posts. We collaborated with mental health experts to generate a clinically sound labeling scheme and an annotated corpus to train machine learning and deep learning models. METHODS: We conducted a study using Bengali text-based data from blogs and open source platforms. We constructed a procedure for annotated corpus generation and extraction of textual information from Bengali literature for predictive analysis. We developed our own structured data set and designed a clinically sound labeling scheme with the help of mental health professionals, adhering to the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) during the process. We used 5 machine learning models for detecting the severity of depression: kernel support vector machine (SVM), random forest, logistic regression K-nearest neighbor (KNN), and complement naive Bayes (NB). For the deep learning approach, we used long short-term memory (LSTM) units and gated recurrent units (GRUs) coupled with convolutional blocks or self-attention layers. Finally, we aimed for enhanced outcomes by using state-of-the-art pretrained language models. RESULTS: The independent recurrent neural network (RNN) models yielded the highest accuracies and weighted F1 scores. GRUs, in particular, produced 81% accuracy. The hybrid architectures could not surpass the RNNs in terms of performance. Kernel SVM with term frequency–inverse document frequency (TF-IDF) embeddings generated 78% accuracy on test data. We used validation and training loss curves to observe and report the performance of our architectures. Overall, the number of available data remained the limitation of our experiment. CONCLUSIONS: The findings from our experimental setup indicate that machine learning and deep learning models are fairly capable of assessing the severity of mental health issues from texts. For the future, we suggest more research endeavors to increase the volume of Bengali text data, in particular, so that modern architectures reach improved generalization capability. JMIR Publications 2022-09-28 /pmc/articles/PMC9557762/ /pubmed/36169989 http://dx.doi.org/10.2196/36118 Text en ©Muhammad Khubayeeb Kabir, Maisha Islam, Anika Nahian Binte Kabir, Adiba Haque, Md Khalilur Rhaman. Originally published in JMIR Formative Research (https://formative.jmir.org), 28.09.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on https://formative.jmir.org, as well as this copyright and license information must be included.
spellingShingle	Original Paper Kabir, Muhammad Khubayeeb Islam, Maisha Kabir, Anika Nahian Binte Haque, Adiba Rhaman, Md Khalilur Detection of Depression Severity Using Bengali Social Media Posts on Mental Health: Study Using Natural Language Processing Techniques
title	Detection of Depression Severity Using Bengali Social Media Posts on Mental Health: Study Using Natural Language Processing Techniques
title_full	Detection of Depression Severity Using Bengali Social Media Posts on Mental Health: Study Using Natural Language Processing Techniques
title_fullStr	Detection of Depression Severity Using Bengali Social Media Posts on Mental Health: Study Using Natural Language Processing Techniques
title_full_unstemmed	Detection of Depression Severity Using Bengali Social Media Posts on Mental Health: Study Using Natural Language Processing Techniques
title_short	Detection of Depression Severity Using Bengali Social Media Posts on Mental Health: Study Using Natural Language Processing Techniques
title_sort	detection of depression severity using bengali social media posts on mental health: study using natural language processing techniques
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9557762/ https://www.ncbi.nlm.nih.gov/pubmed/36169989 http://dx.doi.org/10.2196/36118
work_keys_str_mv	AT kabirmuhammadkhubayeeb detectionofdepressionseverityusingbengalisocialmediapostsonmentalhealthstudyusingnaturallanguageprocessingtechniques AT islammaisha detectionofdepressionseverityusingbengalisocialmediapostsonmentalhealthstudyusingnaturallanguageprocessingtechniques AT kabiranikanahianbinte detectionofdepressionseverityusingbengalisocialmediapostsonmentalhealthstudyusingnaturallanguageprocessingtechniques AT haqueadiba detectionofdepressionseverityusingbengalisocialmediapostsonmentalhealthstudyusingnaturallanguageprocessingtechniques AT rhamanmdkhalilur detectionofdepressionseverityusingbengalisocialmediapostsonmentalhealthstudyusingnaturallanguageprocessingtechniques

Detection of Depression Severity Using Bengali Social Media Posts on Mental Health: Study Using Natural Language Processing Techniques

Ejemplares similares