Cargando…
Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks
The paper considers the problem of classifying 3553 English-language comments from the social network Reddit based on various approaches to the vectorization of comment texts, including bag of words, TF–IDF, bigrams analysis based on pointwise mutual information (PMI) and sentiments, and the deep mo...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Pleiades Publishing
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019794/ http://dx.doi.org/10.1134/S00051179220120025 |
_version_ | 1784908104095236096 |
---|---|
author | Dyulicheva, Yu. Yu. |
author_facet | Dyulicheva, Yu. Yu. |
author_sort | Dyulicheva, Yu. Yu. |
collection | PubMed |
description | The paper considers the problem of classifying 3553 English-language comments from the social network Reddit based on various approaches to the vectorization of comment texts, including bag of words, TF–IDF, bigrams analysis based on pointwise mutual information (PMI) and sentiments, and the deep model BERT of the language representation. The use of a hybrid approach based on text vectorization using BERT and bigrams analysis have made it possible to improve the quality of comments classification up to 91%. Based on a cluster analysis of 1857 English-language comments describing anxiety, clusters were identified using BERT+k-means. The study proposes a hybrid approach based on the use of the LDA topic modeling method, the VADER sentiments analysis method, pointwise mutual information, and parts of speech analysis and permitting one to select bigrams and trigrams to describe clusters of comments. To visualize the extracted patterns in the form of trigrams, a knowledge graph was constructed that describes the subject area, and a comparison of the words of the selected target trigrams with the words of a custom dictionary describing various affective disorders has made it possible to determine the types of psychosocial stressors associated with affective disorders. |
format | Online Article Text |
id | pubmed-10019794 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Pleiades Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-100197942023-03-17 Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks Dyulicheva, Yu. Yu. Autom Remote Control Thematic Issue The paper considers the problem of classifying 3553 English-language comments from the social network Reddit based on various approaches to the vectorization of comment texts, including bag of words, TF–IDF, bigrams analysis based on pointwise mutual information (PMI) and sentiments, and the deep model BERT of the language representation. The use of a hybrid approach based on text vectorization using BERT and bigrams analysis have made it possible to improve the quality of comments classification up to 91%. Based on a cluster analysis of 1857 English-language comments describing anxiety, clusters were identified using BERT+k-means. The study proposes a hybrid approach based on the use of the LDA topic modeling method, the VADER sentiments analysis method, pointwise mutual information, and parts of speech analysis and permitting one to select bigrams and trigrams to describe clusters of comments. To visualize the extracted patterns in the form of trigrams, a knowledge graph was constructed that describes the subject area, and a comparison of the words of the selected target trigrams with the words of a custom dictionary describing various affective disorders has made it possible to determine the types of psychosocial stressors associated with affective disorders. Pleiades Publishing 2023-03-16 2022 /pmc/articles/PMC10019794/ http://dx.doi.org/10.1134/S00051179220120025 Text en © Pleiades Publishing, Ltd. 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Thematic Issue Dyulicheva, Yu. Yu. Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks |
title | Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks |
title_full | Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks |
title_fullStr | Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks |
title_full_unstemmed | Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks |
title_short | Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks |
title_sort | identification of affective states based on automatic analysis of texts of comments in social networks |
topic | Thematic Issue |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019794/ http://dx.doi.org/10.1134/S00051179220120025 |
work_keys_str_mv | AT dyulichevayuyu identificationofaffectivestatesbasedonautomaticanalysisoftextsofcommentsinsocialnetworks |