Cargando…
Hope speech detection in YouTube comments
Recent work on language technology has tried to recognize abusive language such as those containing hate speech and cyberbullying and enhance offensive language identification to moderate social media platforms. Most of these systems depend on machine learning models using a tagged dataset. Such mod...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Vienna
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9263029/ https://www.ncbi.nlm.nih.gov/pubmed/35821874 http://dx.doi.org/10.1007/s13278-022-00901-z |
_version_ | 1784742633005907968 |
---|---|
author | Chakravarthi, Bharathi Raja |
author_facet | Chakravarthi, Bharathi Raja |
author_sort | Chakravarthi, Bharathi Raja |
collection | PubMed |
description | Recent work on language technology has tried to recognize abusive language such as those containing hate speech and cyberbullying and enhance offensive language identification to moderate social media platforms. Most of these systems depend on machine learning models using a tagged dataset. Such models have been successful in detecting and eradicating negativity. However, an additional study has lately been conducted on the enhancement of free expression through social media. Instead of eliminating ostensibly unpleasant words, we created a multilingual dataset to recognize and encourage positivity in the comments, and we propose a novel custom deep network architecture, which uses a concatenation of embedding from T5-Sentence. We have experimented with multiple machine learning models, including SVM, logistic regression, K-nearest neighbour, decision tree, logistic neighbours, and we propose new CNN based model. Our proposed model outperformed all others with a macro F1-score of 0.75 for English, 0.62 for Tamil, and 0.67 for Malayalam. |
format | Online Article Text |
id | pubmed-9263029 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer Vienna |
record_format | MEDLINE/PubMed |
spelling | pubmed-92630292022-07-08 Hope speech detection in YouTube comments Chakravarthi, Bharathi Raja Soc Netw Anal Min Original Article Recent work on language technology has tried to recognize abusive language such as those containing hate speech and cyberbullying and enhance offensive language identification to moderate social media platforms. Most of these systems depend on machine learning models using a tagged dataset. Such models have been successful in detecting and eradicating negativity. However, an additional study has lately been conducted on the enhancement of free expression through social media. Instead of eliminating ostensibly unpleasant words, we created a multilingual dataset to recognize and encourage positivity in the comments, and we propose a novel custom deep network architecture, which uses a concatenation of embedding from T5-Sentence. We have experimented with multiple machine learning models, including SVM, logistic regression, K-nearest neighbour, decision tree, logistic neighbours, and we propose new CNN based model. Our proposed model outperformed all others with a macro F1-score of 0.75 for English, 0.62 for Tamil, and 0.67 for Malayalam. Springer Vienna 2022-07-07 2022 /pmc/articles/PMC9263029/ /pubmed/35821874 http://dx.doi.org/10.1007/s13278-022-00901-z Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Original Article Chakravarthi, Bharathi Raja Hope speech detection in YouTube comments |
title | Hope speech detection in YouTube comments |
title_full | Hope speech detection in YouTube comments |
title_fullStr | Hope speech detection in YouTube comments |
title_full_unstemmed | Hope speech detection in YouTube comments |
title_short | Hope speech detection in YouTube comments |
title_sort | hope speech detection in youtube comments |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9263029/ https://www.ncbi.nlm.nih.gov/pubmed/35821874 http://dx.doi.org/10.1007/s13278-022-00901-z |
work_keys_str_mv | AT chakravarthibharathiraja hopespeechdetectioninyoutubecomments |