Cargando…

A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model

With the current advancement in the Internet, there has been a growing demand for building intelligent and smart systems that can efficiently address the detection of health-related problems on social media, such as the detection of depression and anxiety. These types of systems, which are mainly de...

Descripción completa

Detalles Bibliográficos
Autores principales: Zeberga, Kamil, Attique, Muhammad, Shah, Babar, Ali, Farman, Jembre, Yalew Zelalem, Chung, Tae-Sun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8913054/
https://www.ncbi.nlm.nih.gov/pubmed/35281185
http://dx.doi.org/10.1155/2022/7893775
_version_ 1784667327144394752
author Zeberga, Kamil
Attique, Muhammad
Shah, Babar
Ali, Farman
Jembre, Yalew Zelalem
Chung, Tae-Sun
author_facet Zeberga, Kamil
Attique, Muhammad
Shah, Babar
Ali, Farman
Jembre, Yalew Zelalem
Chung, Tae-Sun
author_sort Zeberga, Kamil
collection PubMed
description With the current advancement in the Internet, there has been a growing demand for building intelligent and smart systems that can efficiently address the detection of health-related problems on social media, such as the detection of depression and anxiety. These types of systems, which are mainly dependent on machine learning techniques, must be able to deal with obtaining the semantic and syntactic meaning of texts posted by users on social media. The data generated by users on social media contains unstructured and unpredictable content. Several systems based on machine learning and social media platforms have recently been introduced to identify health-related problems. However, the text representation and deep learning techniques employed provide only limited information and knowledge about the different texts posted by users. This is owing to a lack of long-term dependencies between each word in the entire text and a lack of proper exploitation of recent deep learning schemes. In this paper, we propose a novel framework to efficiently and effectively identify depression and anxiety-related posts while maintaining the contextual and semantic meaning of the words used in the whole corpus when applying bidirectional encoder representations from transformers (BERT). In addition, we propose a knowledge distillation technique, which is a recent technique for transferring knowledge from a large pretrained model (BERT) to a smaller model to boost performance and accuracy. We also devised our own data collection framework from Reddit and Twitter, which are the most common social media sites. Finally, we employed word2vec and BERT with Bi-LSTM to effectively analyze and detect depression and anxiety signs from social media posts. Our system surpasses other state-of-the-art methods and achieves an accuracy of 98% using the knowledge distillation technique.
format Online
Article
Text
id pubmed-8913054
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-89130542022-03-11 A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model Zeberga, Kamil Attique, Muhammad Shah, Babar Ali, Farman Jembre, Yalew Zelalem Chung, Tae-Sun Comput Intell Neurosci Research Article With the current advancement in the Internet, there has been a growing demand for building intelligent and smart systems that can efficiently address the detection of health-related problems on social media, such as the detection of depression and anxiety. These types of systems, which are mainly dependent on machine learning techniques, must be able to deal with obtaining the semantic and syntactic meaning of texts posted by users on social media. The data generated by users on social media contains unstructured and unpredictable content. Several systems based on machine learning and social media platforms have recently been introduced to identify health-related problems. However, the text representation and deep learning techniques employed provide only limited information and knowledge about the different texts posted by users. This is owing to a lack of long-term dependencies between each word in the entire text and a lack of proper exploitation of recent deep learning schemes. In this paper, we propose a novel framework to efficiently and effectively identify depression and anxiety-related posts while maintaining the contextual and semantic meaning of the words used in the whole corpus when applying bidirectional encoder representations from transformers (BERT). In addition, we propose a knowledge distillation technique, which is a recent technique for transferring knowledge from a large pretrained model (BERT) to a smaller model to boost performance and accuracy. We also devised our own data collection framework from Reddit and Twitter, which are the most common social media sites. Finally, we employed word2vec and BERT with Bi-LSTM to effectively analyze and detect depression and anxiety signs from social media posts. Our system surpasses other state-of-the-art methods and achieves an accuracy of 98% using the knowledge distillation technique. Hindawi 2022-03-03 /pmc/articles/PMC8913054/ /pubmed/35281185 http://dx.doi.org/10.1155/2022/7893775 Text en Copyright © 2022 Kamil Zeberga et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zeberga, Kamil
Attique, Muhammad
Shah, Babar
Ali, Farman
Jembre, Yalew Zelalem
Chung, Tae-Sun
A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model
title A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model
title_full A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model
title_fullStr A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model
title_full_unstemmed A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model
title_short A Novel Text Mining Approach for Mental Health Prediction Using Bi-LSTM and BERT Model
title_sort novel text mining approach for mental health prediction using bi-lstm and bert model
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8913054/
https://www.ncbi.nlm.nih.gov/pubmed/35281185
http://dx.doi.org/10.1155/2022/7893775
work_keys_str_mv AT zebergakamil anoveltextminingapproachformentalhealthpredictionusingbilstmandbertmodel
AT attiquemuhammad anoveltextminingapproachformentalhealthpredictionusingbilstmandbertmodel
AT shahbabar anoveltextminingapproachformentalhealthpredictionusingbilstmandbertmodel
AT alifarman anoveltextminingapproachformentalhealthpredictionusingbilstmandbertmodel
AT jembreyalewzelalem anoveltextminingapproachformentalhealthpredictionusingbilstmandbertmodel
AT chungtaesun anoveltextminingapproachformentalhealthpredictionusingbilstmandbertmodel
AT zebergakamil noveltextminingapproachformentalhealthpredictionusingbilstmandbertmodel
AT attiquemuhammad noveltextminingapproachformentalhealthpredictionusingbilstmandbertmodel
AT shahbabar noveltextminingapproachformentalhealthpredictionusingbilstmandbertmodel
AT alifarman noveltextminingapproachformentalhealthpredictionusingbilstmandbertmodel
AT jembreyalewzelalem noveltextminingapproachformentalhealthpredictionusingbilstmandbertmodel
AT chungtaesun noveltextminingapproachformentalhealthpredictionusingbilstmandbertmodel