Cargando…

Hope speech detection in YouTube comments

Recent work on language technology has tried to recognize abusive language such as those containing hate speech and cyberbullying and enhance offensive language identification to moderate social media platforms. Most of these systems depend on machine learning models using a tagged dataset. Such mod...

Descripción completa

Detalles Bibliográficos
Autor principal: Chakravarthi, Bharathi Raja
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Vienna 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9263029/
https://www.ncbi.nlm.nih.gov/pubmed/35821874
http://dx.doi.org/10.1007/s13278-022-00901-z
_version_ 1784742633005907968
author Chakravarthi, Bharathi Raja
author_facet Chakravarthi, Bharathi Raja
author_sort Chakravarthi, Bharathi Raja
collection PubMed
description Recent work on language technology has tried to recognize abusive language such as those containing hate speech and cyberbullying and enhance offensive language identification to moderate social media platforms. Most of these systems depend on machine learning models using a tagged dataset. Such models have been successful in detecting and eradicating negativity. However, an additional study has lately been conducted on the enhancement of free expression through social media. Instead of eliminating ostensibly unpleasant words, we created a multilingual dataset to recognize and encourage positivity in the comments, and we propose a novel custom deep network architecture, which uses a concatenation of embedding from T5-Sentence. We have experimented with multiple machine learning models, including SVM, logistic regression, K-nearest neighbour, decision tree, logistic neighbours, and we propose new CNN based model. Our proposed model outperformed all others with a macro F1-score of 0.75 for English, 0.62 for Tamil, and 0.67 for Malayalam.
format Online
Article
Text
id pubmed-9263029
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer Vienna
record_format MEDLINE/PubMed
spelling pubmed-92630292022-07-08 Hope speech detection in YouTube comments Chakravarthi, Bharathi Raja Soc Netw Anal Min Original Article Recent work on language technology has tried to recognize abusive language such as those containing hate speech and cyberbullying and enhance offensive language identification to moderate social media platforms. Most of these systems depend on machine learning models using a tagged dataset. Such models have been successful in detecting and eradicating negativity. However, an additional study has lately been conducted on the enhancement of free expression through social media. Instead of eliminating ostensibly unpleasant words, we created a multilingual dataset to recognize and encourage positivity in the comments, and we propose a novel custom deep network architecture, which uses a concatenation of embedding from T5-Sentence. We have experimented with multiple machine learning models, including SVM, logistic regression, K-nearest neighbour, decision tree, logistic neighbours, and we propose new CNN based model. Our proposed model outperformed all others with a macro F1-score of 0.75 for English, 0.62 for Tamil, and 0.67 for Malayalam. Springer Vienna 2022-07-07 2022 /pmc/articles/PMC9263029/ /pubmed/35821874 http://dx.doi.org/10.1007/s13278-022-00901-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Article
Chakravarthi, Bharathi Raja
Hope speech detection in YouTube comments
title Hope speech detection in YouTube comments
title_full Hope speech detection in YouTube comments
title_fullStr Hope speech detection in YouTube comments
title_full_unstemmed Hope speech detection in YouTube comments
title_short Hope speech detection in YouTube comments
title_sort hope speech detection in youtube comments
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9263029/
https://www.ncbi.nlm.nih.gov/pubmed/35821874
http://dx.doi.org/10.1007/s13278-022-00901-z
work_keys_str_mv AT chakravarthibharathiraja hopespeechdetectioninyoutubecomments