Cargando…

Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks

The paper considers the problem of classifying 3553 English-language comments from the social network Reddit based on various approaches to the vectorization of comment texts, including bag of words, TF–IDF, bigrams analysis based on pointwise mutual information (PMI) and sentiments, and the deep mo...

Descripción completa

Detalles Bibliográficos
Autor principal: Dyulicheva, Yu. Yu.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Pleiades Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019794/
http://dx.doi.org/10.1134/S00051179220120025
_version_ 1784908104095236096
author Dyulicheva, Yu. Yu.
author_facet Dyulicheva, Yu. Yu.
author_sort Dyulicheva, Yu. Yu.
collection PubMed
description The paper considers the problem of classifying 3553 English-language comments from the social network Reddit based on various approaches to the vectorization of comment texts, including bag of words, TF–IDF, bigrams analysis based on pointwise mutual information (PMI) and sentiments, and the deep model BERT of the language representation. The use of a hybrid approach based on text vectorization using BERT and bigrams analysis have made it possible to improve the quality of comments classification up to 91%. Based on a cluster analysis of 1857 English-language comments describing anxiety, clusters were identified using BERT+k-means. The study proposes a hybrid approach based on the use of the LDA topic modeling method, the VADER sentiments analysis method, pointwise mutual information, and parts of speech analysis and permitting one to select bigrams and trigrams to describe clusters of comments. To visualize the extracted patterns in the form of trigrams, a knowledge graph was constructed that describes the subject area, and a comparison of the words of the selected target trigrams with the words of a custom dictionary describing various affective disorders has made it possible to determine the types of psychosocial stressors associated with affective disorders.
format Online
Article
Text
id pubmed-10019794
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Pleiades Publishing
record_format MEDLINE/PubMed
spelling pubmed-100197942023-03-17 Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks Dyulicheva, Yu. Yu. Autom Remote Control Thematic Issue The paper considers the problem of classifying 3553 English-language comments from the social network Reddit based on various approaches to the vectorization of comment texts, including bag of words, TF–IDF, bigrams analysis based on pointwise mutual information (PMI) and sentiments, and the deep model BERT of the language representation. The use of a hybrid approach based on text vectorization using BERT and bigrams analysis have made it possible to improve the quality of comments classification up to 91%. Based on a cluster analysis of 1857 English-language comments describing anxiety, clusters were identified using BERT+k-means. The study proposes a hybrid approach based on the use of the LDA topic modeling method, the VADER sentiments analysis method, pointwise mutual information, and parts of speech analysis and permitting one to select bigrams and trigrams to describe clusters of comments. To visualize the extracted patterns in the form of trigrams, a knowledge graph was constructed that describes the subject area, and a comparison of the words of the selected target trigrams with the words of a custom dictionary describing various affective disorders has made it possible to determine the types of psychosocial stressors associated with affective disorders. Pleiades Publishing 2023-03-16 2022 /pmc/articles/PMC10019794/ http://dx.doi.org/10.1134/S00051179220120025 Text en © Pleiades Publishing, Ltd. 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Thematic Issue
Dyulicheva, Yu. Yu.
Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks
title Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks
title_full Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks
title_fullStr Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks
title_full_unstemmed Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks
title_short Identification of Affective States Based on Automatic Analysis of Texts of Comments in Social Networks
title_sort identification of affective states based on automatic analysis of texts of comments in social networks
topic Thematic Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10019794/
http://dx.doi.org/10.1134/S00051179220120025
work_keys_str_mv AT dyulichevayuyu identificationofaffectivestatesbasedonautomaticanalysisoftextsofcommentsinsocialnetworks