Cargando…

Detecting depression of Chinese microblog users via text analysis: Combining Linguistic Inquiry Word Count (LIWC) with culture and suicide related lexicons

INTRODUCTION: In recent years, research has used psycholinguistic features in public discourse, networking behaviors on social media and profile information to train models for depression detection. However, the most widely adopted approach for the extraction of psycholinguistic features is to use t...

Descripción completa

Detalles Bibliográficos
Autores principales: Lyu, Sihua, Ren, Xiaopeng, Du, Yihua, Zhao, Nan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9947407/
https://www.ncbi.nlm.nih.gov/pubmed/36846219
http://dx.doi.org/10.3389/fpsyt.2023.1121583
_version_ 1784892546700279808
author Lyu, Sihua
Ren, Xiaopeng
Du, Yihua
Zhao, Nan
author_facet Lyu, Sihua
Ren, Xiaopeng
Du, Yihua
Zhao, Nan
author_sort Lyu, Sihua
collection PubMed
description INTRODUCTION: In recent years, research has used psycholinguistic features in public discourse, networking behaviors on social media and profile information to train models for depression detection. However, the most widely adopted approach for the extraction of psycholinguistic features is to use the Linguistic Inquiry Word Count (LIWC) dictionary and various affective lexicons. Other features related to cultural factors and suicide risk have not been explored. Moreover, the use of social networking behavioral features and profile features would limit the generalizability of the model. Therefore, our study aimed at building a prediction model of depression for text-only social media data through a wider range of possible linguistic features related to depression, and illuminate the relationship between linguistic expression and depression. METHODS: We collected 789 users’ depression scores as well as their past posts on Weibo, and extracted a total of 117 lexical features via Simplified Chinese Linguistic Inquiry Word Count, Chinese Suicide Dictionary, Chinese Version of Moral Foundations Dictionary, Chinese Version of Moral Motivation Dictionary, and Chinese Individualism/Collectivism Dictionary. RESULTS: Results showed that all the dictionaries contributed to the prediction. The best performing model occurred with linear regression, with the Pearson correlation coefficient between predicted values and self-reported values was 0.33, the R-squared was 0.10, and the split-half reliability was 0.75. DISCUSSION: This study did not only develop a predictive model applicable to text-only social media data, but also demonstrated the importance taking cultural psychological factors and suicide related expressions into consideration in the calculation of word frequency. Our research provided a more comprehensive understanding of how lexicons related to cultural psychology and suicide risk were associated with depression, and could contribute to the recognition of depression.
format Online
Article
Text
id pubmed-9947407
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-99474072023-02-24 Detecting depression of Chinese microblog users via text analysis: Combining Linguistic Inquiry Word Count (LIWC) with culture and suicide related lexicons Lyu, Sihua Ren, Xiaopeng Du, Yihua Zhao, Nan Front Psychiatry Psychiatry INTRODUCTION: In recent years, research has used psycholinguistic features in public discourse, networking behaviors on social media and profile information to train models for depression detection. However, the most widely adopted approach for the extraction of psycholinguistic features is to use the Linguistic Inquiry Word Count (LIWC) dictionary and various affective lexicons. Other features related to cultural factors and suicide risk have not been explored. Moreover, the use of social networking behavioral features and profile features would limit the generalizability of the model. Therefore, our study aimed at building a prediction model of depression for text-only social media data through a wider range of possible linguistic features related to depression, and illuminate the relationship between linguistic expression and depression. METHODS: We collected 789 users’ depression scores as well as their past posts on Weibo, and extracted a total of 117 lexical features via Simplified Chinese Linguistic Inquiry Word Count, Chinese Suicide Dictionary, Chinese Version of Moral Foundations Dictionary, Chinese Version of Moral Motivation Dictionary, and Chinese Individualism/Collectivism Dictionary. RESULTS: Results showed that all the dictionaries contributed to the prediction. The best performing model occurred with linear regression, with the Pearson correlation coefficient between predicted values and self-reported values was 0.33, the R-squared was 0.10, and the split-half reliability was 0.75. DISCUSSION: This study did not only develop a predictive model applicable to text-only social media data, but also demonstrated the importance taking cultural psychological factors and suicide related expressions into consideration in the calculation of word frequency. Our research provided a more comprehensive understanding of how lexicons related to cultural psychology and suicide risk were associated with depression, and could contribute to the recognition of depression. Frontiers Media S.A. 2023-02-09 /pmc/articles/PMC9947407/ /pubmed/36846219 http://dx.doi.org/10.3389/fpsyt.2023.1121583 Text en Copyright © 2023 Lyu, Ren, Du and Zhao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychiatry
Lyu, Sihua
Ren, Xiaopeng
Du, Yihua
Zhao, Nan
Detecting depression of Chinese microblog users via text analysis: Combining Linguistic Inquiry Word Count (LIWC) with culture and suicide related lexicons
title Detecting depression of Chinese microblog users via text analysis: Combining Linguistic Inquiry Word Count (LIWC) with culture and suicide related lexicons
title_full Detecting depression of Chinese microblog users via text analysis: Combining Linguistic Inquiry Word Count (LIWC) with culture and suicide related lexicons
title_fullStr Detecting depression of Chinese microblog users via text analysis: Combining Linguistic Inquiry Word Count (LIWC) with culture and suicide related lexicons
title_full_unstemmed Detecting depression of Chinese microblog users via text analysis: Combining Linguistic Inquiry Word Count (LIWC) with culture and suicide related lexicons
title_short Detecting depression of Chinese microblog users via text analysis: Combining Linguistic Inquiry Word Count (LIWC) with culture and suicide related lexicons
title_sort detecting depression of chinese microblog users via text analysis: combining linguistic inquiry word count (liwc) with culture and suicide related lexicons
topic Psychiatry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9947407/
https://www.ncbi.nlm.nih.gov/pubmed/36846219
http://dx.doi.org/10.3389/fpsyt.2023.1121583
work_keys_str_mv AT lyusihua detectingdepressionofchinesemicroblogusersviatextanalysiscombininglinguisticinquirywordcountliwcwithcultureandsuiciderelatedlexicons
AT renxiaopeng detectingdepressionofchinesemicroblogusersviatextanalysiscombininglinguisticinquirywordcountliwcwithcultureandsuiciderelatedlexicons
AT duyihua detectingdepressionofchinesemicroblogusersviatextanalysiscombininglinguisticinquirywordcountliwcwithcultureandsuiciderelatedlexicons
AT zhaonan detectingdepressionofchinesemicroblogusersviatextanalysiscombininglinguisticinquirywordcountliwcwithcultureandsuiciderelatedlexicons