Cargando…

Towards generalisable hate speech detection: a review on obstacles and solutions

Hate speech is one type of harmful online content which directly attacks or promotes hate towards a group or an individual member based on their actual or perceived aspects of identity, such as ethnicity, religion, and sexual orientation. With online hate speech on the rise, its automatic detection...

Descripción completa

Detalles Bibliográficos
Autores principales: Yin, Wenjie, Zubiaga, Arkaitz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8237316/
https://www.ncbi.nlm.nih.gov/pubmed/34239978
http://dx.doi.org/10.7717/peerj-cs.598
_version_ 1783714706555404288
author Yin, Wenjie
Zubiaga, Arkaitz
author_facet Yin, Wenjie
Zubiaga, Arkaitz
author_sort Yin, Wenjie
collection PubMed
description Hate speech is one type of harmful online content which directly attacks or promotes hate towards a group or an individual member based on their actual or perceived aspects of identity, such as ethnicity, religion, and sexual orientation. With online hate speech on the rise, its automatic detection as a natural language processing task is gaining increasing interest. However, it is only recently that it has been shown that existing models generalise poorly to unseen data. This survey paper attempts to summarise how generalisable existing hate speech detection models are and the reasons why hate speech models struggle to generalise, sums up existing attempts at addressing the main obstacles, and then proposes directions of future research to improve generalisation in hate speech detection.
format Online
Article
Text
id pubmed-8237316
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-82373162021-07-07 Towards generalisable hate speech detection: a review on obstacles and solutions Yin, Wenjie Zubiaga, Arkaitz PeerJ Comput Sci Artificial Intelligence Hate speech is one type of harmful online content which directly attacks or promotes hate towards a group or an individual member based on their actual or perceived aspects of identity, such as ethnicity, religion, and sexual orientation. With online hate speech on the rise, its automatic detection as a natural language processing task is gaining increasing interest. However, it is only recently that it has been shown that existing models generalise poorly to unseen data. This survey paper attempts to summarise how generalisable existing hate speech detection models are and the reasons why hate speech models struggle to generalise, sums up existing attempts at addressing the main obstacles, and then proposes directions of future research to improve generalisation in hate speech detection. PeerJ Inc. 2021-06-17 /pmc/articles/PMC8237316/ /pubmed/34239978 http://dx.doi.org/10.7717/peerj-cs.598 Text en ©2021 Yin and Zubiaga https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Artificial Intelligence
Yin, Wenjie
Zubiaga, Arkaitz
Towards generalisable hate speech detection: a review on obstacles and solutions
title Towards generalisable hate speech detection: a review on obstacles and solutions
title_full Towards generalisable hate speech detection: a review on obstacles and solutions
title_fullStr Towards generalisable hate speech detection: a review on obstacles and solutions
title_full_unstemmed Towards generalisable hate speech detection: a review on obstacles and solutions
title_short Towards generalisable hate speech detection: a review on obstacles and solutions
title_sort towards generalisable hate speech detection: a review on obstacles and solutions
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8237316/
https://www.ncbi.nlm.nih.gov/pubmed/34239978
http://dx.doi.org/10.7717/peerj-cs.598
work_keys_str_mv AT yinwenjie towardsgeneralisablehatespeechdetectionareviewonobstaclesandsolutions
AT zubiagaarkaitz towardsgeneralisablehatespeechdetectionareviewonobstaclesandsolutions