Cargando…

Linguistic Patterns for Code Word Resilient Hate Speech Identification

The permanent transition to online activity has brought with it a surge in hate speech discourse. This has prompted increased calls for automatic detection methods, most of which currently rely on a dictionary of hate speech words, and supervised classification. This approach often falls short when...

Descripción completa

Detalles Bibliográficos
Autores principales: Calderón, Fernando H., Balani, Namrita, Taylor, Jherez, Peignon, Melvyn, Huang, Yen-Hao, Chen, Yi-Shin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659976/
https://www.ncbi.nlm.nih.gov/pubmed/34883861
http://dx.doi.org/10.3390/s21237859
_version_ 1784613091719249920
author Calderón, Fernando H.
Balani, Namrita
Taylor, Jherez
Peignon, Melvyn
Huang, Yen-Hao
Chen, Yi-Shin
author_facet Calderón, Fernando H.
Balani, Namrita
Taylor, Jherez
Peignon, Melvyn
Huang, Yen-Hao
Chen, Yi-Shin
author_sort Calderón, Fernando H.
collection PubMed
description The permanent transition to online activity has brought with it a surge in hate speech discourse. This has prompted increased calls for automatic detection methods, most of which currently rely on a dictionary of hate speech words, and supervised classification. This approach often falls short when dealing with newer words and phrases produced by online extremist communities. These code words are used with the aim of evading automatic detection by systems. Code words are frequently used and have benign meanings in regular discourse, for instance, “skypes, googles, bing, yahoos” are all examples of words that have a hidden hate speech meaning. Such overlap presents a challenge to the traditional keyword approach of collecting data that is specific to hate speech. In this work, we first introduced a word embedding model that learns the hidden hate speech meaning of words. With this insight on code words, we developed a classifier that leverages linguistic patterns to reduce the impact of individual words. The proposed method was evaluated across three different datasets to test its generalizability. The empirical results show that the linguistic patterns approach outperforms the baselines and enables further analysis on hate speech expressions.
format Online
Article
Text
id pubmed-8659976
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86599762021-12-10 Linguistic Patterns for Code Word Resilient Hate Speech Identification Calderón, Fernando H. Balani, Namrita Taylor, Jherez Peignon, Melvyn Huang, Yen-Hao Chen, Yi-Shin Sensors (Basel) Article The permanent transition to online activity has brought with it a surge in hate speech discourse. This has prompted increased calls for automatic detection methods, most of which currently rely on a dictionary of hate speech words, and supervised classification. This approach often falls short when dealing with newer words and phrases produced by online extremist communities. These code words are used with the aim of evading automatic detection by systems. Code words are frequently used and have benign meanings in regular discourse, for instance, “skypes, googles, bing, yahoos” are all examples of words that have a hidden hate speech meaning. Such overlap presents a challenge to the traditional keyword approach of collecting data that is specific to hate speech. In this work, we first introduced a word embedding model that learns the hidden hate speech meaning of words. With this insight on code words, we developed a classifier that leverages linguistic patterns to reduce the impact of individual words. The proposed method was evaluated across three different datasets to test its generalizability. The empirical results show that the linguistic patterns approach outperforms the baselines and enables further analysis on hate speech expressions. MDPI 2021-11-25 /pmc/articles/PMC8659976/ /pubmed/34883861 http://dx.doi.org/10.3390/s21237859 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Calderón, Fernando H.
Balani, Namrita
Taylor, Jherez
Peignon, Melvyn
Huang, Yen-Hao
Chen, Yi-Shin
Linguistic Patterns for Code Word Resilient Hate Speech Identification
title Linguistic Patterns for Code Word Resilient Hate Speech Identification
title_full Linguistic Patterns for Code Word Resilient Hate Speech Identification
title_fullStr Linguistic Patterns for Code Word Resilient Hate Speech Identification
title_full_unstemmed Linguistic Patterns for Code Word Resilient Hate Speech Identification
title_short Linguistic Patterns for Code Word Resilient Hate Speech Identification
title_sort linguistic patterns for code word resilient hate speech identification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8659976/
https://www.ncbi.nlm.nih.gov/pubmed/34883861
http://dx.doi.org/10.3390/s21237859
work_keys_str_mv AT calderonfernandoh linguisticpatternsforcodewordresilienthatespeechidentification
AT balaninamrita linguisticpatternsforcodewordresilienthatespeechidentification
AT taylorjherez linguisticpatternsforcodewordresilienthatespeechidentification
AT peignonmelvyn linguisticpatternsforcodewordresilienthatespeechidentification
AT huangyenhao linguisticpatternsforcodewordresilienthatespeechidentification
AT chenyishin linguisticpatternsforcodewordresilienthatespeechidentification