Cargando…

Emotionally Informed Hate Speech Detection: A Multi-target Perspective

Hate Speech and harassment are widespread in online communication, due to users' freedom and anonymity and the lack of regulation provided by social media platforms. Hate speech is topically focused (misogyny, sexism, racism, xenophobia, homophobia, etc.), and each specific manifestation of hat...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chiril, Patricia, Pamungkas, Endang Wahyu, Benamara, Farah, Moriceau, Véronique, Patti, Viviana
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer US 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8236572/ https://www.ncbi.nlm.nih.gov/pubmed/34221180 http://dx.doi.org/10.1007/s12559-021-09862-5

_version_	1783714565970722816
author	Chiril, Patricia Pamungkas, Endang Wahyu Benamara, Farah Moriceau, Véronique Patti, Viviana
author_facet	Chiril, Patricia Pamungkas, Endang Wahyu Benamara, Farah Moriceau, Véronique Patti, Viviana
author_sort	Chiril, Patricia
collection	PubMed
description	Hate Speech and harassment are widespread in online communication, due to users' freedom and anonymity and the lack of regulation provided by social media platforms. Hate speech is topically focused (misogyny, sexism, racism, xenophobia, homophobia, etc.), and each specific manifestation of hate speech targets different vulnerable groups based on characteristics such as gender (misogyny, sexism), ethnicity, race, religion (xenophobia, racism, Islamophobia), sexual orientation (homophobia), and so on. Most automatic hate speech detection approaches cast the problem into a binary classification task without addressing either the topical focus or the target-oriented nature of hate speech. In this paper, we propose to tackle, for the first time, hate speech detection from a multi-target perspective. We leverage manually annotated datasets, to investigate the problem of transferring knowledge from different datasets with different topical focuses and targets. Our contribution is threefold: (1) we explore the ability of hate speech detection models to capture common properties from topic-generic datasets and transfer this knowledge to recognize specific manifestations of hate speech; (2) we experiment with the development of models to detect both topics (racism, xenophobia, sexism, misogyny) and hate speech targets, going beyond standard binary classification, to investigate how to detect hate speech at a finer level of granularity and how to transfer knowledge across different topics and targets; and (3) we study the impact of affective knowledge encoded in sentic computing resources (SenticNet, EmoSenticNet) and in semantically structured hate lexicons (HurtLex) in determining specific manifestations of hate speech. We experimented with different neural models including multitask approaches. Our study shows that: (1) training a model on a combination of several (training sets from several) topic-specific datasets is more effective than training a model on a topic-generic dataset; (2) the multi-task approach outperforms a single-task model when detecting both the hatefulness of a tweet and its topical focus in the context of a multi-label classification approach; and (3) the models incorporating EmoSenticNet emotions, the first level emotions of SenticNet, a blend of SenticNet and EmoSenticNet emotions or affective features based on Hurtlex, obtained the best results. Our results demonstrate that multi-target hate speech detection from existing datasets is feasible, which is a first step towards hate speech detection for a specific topic/target when dedicated annotated data are missing. Moreover, we prove that domain-independent affective knowledge, injected into our models, helps finer-grained hate speech detection.
format	Online Article Text
id	pubmed-8236572
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Springer US
record_format	MEDLINE/PubMed
spelling	pubmed-82365722021-06-28 Emotionally Informed Hate Speech Detection: A Multi-target Perspective Chiril, Patricia Pamungkas, Endang Wahyu Benamara, Farah Moriceau, Véronique Patti, Viviana Cognit Comput Article Hate Speech and harassment are widespread in online communication, due to users' freedom and anonymity and the lack of regulation provided by social media platforms. Hate speech is topically focused (misogyny, sexism, racism, xenophobia, homophobia, etc.), and each specific manifestation of hate speech targets different vulnerable groups based on characteristics such as gender (misogyny, sexism), ethnicity, race, religion (xenophobia, racism, Islamophobia), sexual orientation (homophobia), and so on. Most automatic hate speech detection approaches cast the problem into a binary classification task without addressing either the topical focus or the target-oriented nature of hate speech. In this paper, we propose to tackle, for the first time, hate speech detection from a multi-target perspective. We leverage manually annotated datasets, to investigate the problem of transferring knowledge from different datasets with different topical focuses and targets. Our contribution is threefold: (1) we explore the ability of hate speech detection models to capture common properties from topic-generic datasets and transfer this knowledge to recognize specific manifestations of hate speech; (2) we experiment with the development of models to detect both topics (racism, xenophobia, sexism, misogyny) and hate speech targets, going beyond standard binary classification, to investigate how to detect hate speech at a finer level of granularity and how to transfer knowledge across different topics and targets; and (3) we study the impact of affective knowledge encoded in sentic computing resources (SenticNet, EmoSenticNet) and in semantically structured hate lexicons (HurtLex) in determining specific manifestations of hate speech. We experimented with different neural models including multitask approaches. Our study shows that: (1) training a model on a combination of several (training sets from several) topic-specific datasets is more effective than training a model on a topic-generic dataset; (2) the multi-task approach outperforms a single-task model when detecting both the hatefulness of a tweet and its topical focus in the context of a multi-label classification approach; and (3) the models incorporating EmoSenticNet emotions, the first level emotions of SenticNet, a blend of SenticNet and EmoSenticNet emotions or affective features based on Hurtlex, obtained the best results. Our results demonstrate that multi-target hate speech detection from existing datasets is feasible, which is a first step towards hate speech detection for a specific topic/target when dedicated annotated data are missing. Moreover, we prove that domain-independent affective knowledge, injected into our models, helps finer-grained hate speech detection. Springer US 2021-06-28 2022 /pmc/articles/PMC8236572/ /pubmed/34221180 http://dx.doi.org/10.1007/s12559-021-09862-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Chiril, Patricia Pamungkas, Endang Wahyu Benamara, Farah Moriceau, Véronique Patti, Viviana Emotionally Informed Hate Speech Detection: A Multi-target Perspective
title	Emotionally Informed Hate Speech Detection: A Multi-target Perspective
title_full	Emotionally Informed Hate Speech Detection: A Multi-target Perspective
title_fullStr	Emotionally Informed Hate Speech Detection: A Multi-target Perspective
title_full_unstemmed	Emotionally Informed Hate Speech Detection: A Multi-target Perspective
title_short	Emotionally Informed Hate Speech Detection: A Multi-target Perspective
title_sort	emotionally informed hate speech detection: a multi-target perspective
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8236572/ https://www.ncbi.nlm.nih.gov/pubmed/34221180 http://dx.doi.org/10.1007/s12559-021-09862-5
work_keys_str_mv	AT chirilpatricia emotionallyinformedhatespeechdetectionamultitargetperspective AT pamungkasendangwahyu emotionallyinformedhatespeechdetectionamultitargetperspective AT benamarafarah emotionallyinformedhatespeechdetectionamultitargetperspective AT moriceauveronique emotionallyinformedhatespeechdetectionamultitargetperspective AT pattiviviana emotionallyinformedhatespeechdetectionamultitargetperspective

Emotionally Informed Hate Speech Detection: A Multi-target Perspective

Ejemplares similares