Cargando…

The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring

Hate speech has been identified as a pressing problem in society and several automated approaches have been designed to detect and prevent it. This paper reports and reflects upon an action research setting consisting of multi-organizational collaboration conducted during Finnish municipal elections...

Descripción completa

Detalles Bibliográficos
Autores principales: Laaksonen, Salla-Maaria, Haapoja, Jesse, Kinnunen, Teemu, Nelimarkka, Matti, Pöyhtäri, Reeta
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931925/
https://www.ncbi.nlm.nih.gov/pubmed/33693378
http://dx.doi.org/10.3389/fdata.2020.00003
_version_ 1783660384349061120
author Laaksonen, Salla-Maaria
Haapoja, Jesse
Kinnunen, Teemu
Nelimarkka, Matti
Pöyhtäri, Reeta
author_facet Laaksonen, Salla-Maaria
Haapoja, Jesse
Kinnunen, Teemu
Nelimarkka, Matti
Pöyhtäri, Reeta
author_sort Laaksonen, Salla-Maaria
collection PubMed
description Hate speech has been identified as a pressing problem in society and several automated approaches have been designed to detect and prevent it. This paper reports and reflects upon an action research setting consisting of multi-organizational collaboration conducted during Finnish municipal elections in 2017, wherein a technical infrastructure was designed to automatically monitor candidates' social media updates for hate speech. The setting allowed us to engage in a 2-fold investigation. First, the collaboration offered a unique view for exploring how hate speech emerges as a technical problem. The project developed an adequately well-working algorithmic solution using supervised machine learning. We tested the performance of various feature extraction and machine learning methods and ended up using a combination of Bag-of-Words feature extraction with Support-Vector Machines. However, an automated approach required heavy simplification, such as using rudimentary scales for classifying hate speech and a reliance on word-based approaches, while in reality hate speech is a linguistic and social phenomenon with various tones and forms. Second, the action-research-oriented setting allowed us to observe affective responses, such as the hopes, dreams, and fears related to machine learning technology. Based on participatory observations, project artifacts and documents, interviews with project participants, and online reactions to the detection project, we identified participants' aspirations for effective automation as well as the level of neutrality and objectivity introduced by an algorithmic system. However, the participants expressed more critical views toward the system after the monitoring process. Our findings highlight how the powerful expectations related to technology can easily end up dominating a project dealing with a contested, topical social issue. We conclude by discussing the problematic aspects of datafying hate and suggesting some practical implications for hate speech recognition.
format Online
Article
Text
id pubmed-7931925
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79319252021-03-09 The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring Laaksonen, Salla-Maaria Haapoja, Jesse Kinnunen, Teemu Nelimarkka, Matti Pöyhtäri, Reeta Front Big Data Big Data Hate speech has been identified as a pressing problem in society and several automated approaches have been designed to detect and prevent it. This paper reports and reflects upon an action research setting consisting of multi-organizational collaboration conducted during Finnish municipal elections in 2017, wherein a technical infrastructure was designed to automatically monitor candidates' social media updates for hate speech. The setting allowed us to engage in a 2-fold investigation. First, the collaboration offered a unique view for exploring how hate speech emerges as a technical problem. The project developed an adequately well-working algorithmic solution using supervised machine learning. We tested the performance of various feature extraction and machine learning methods and ended up using a combination of Bag-of-Words feature extraction with Support-Vector Machines. However, an automated approach required heavy simplification, such as using rudimentary scales for classifying hate speech and a reliance on word-based approaches, while in reality hate speech is a linguistic and social phenomenon with various tones and forms. Second, the action-research-oriented setting allowed us to observe affective responses, such as the hopes, dreams, and fears related to machine learning technology. Based on participatory observations, project artifacts and documents, interviews with project participants, and online reactions to the detection project, we identified participants' aspirations for effective automation as well as the level of neutrality and objectivity introduced by an algorithmic system. However, the participants expressed more critical views toward the system after the monitoring process. Our findings highlight how the powerful expectations related to technology can easily end up dominating a project dealing with a contested, topical social issue. We conclude by discussing the problematic aspects of datafying hate and suggesting some practical implications for hate speech recognition. Frontiers Media S.A. 2020-02-05 /pmc/articles/PMC7931925/ /pubmed/33693378 http://dx.doi.org/10.3389/fdata.2020.00003 Text en Copyright © 2020 Laaksonen, Haapoja, Kinnunen, Nelimarkka and Pöyhtäri. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Laaksonen, Salla-Maaria
Haapoja, Jesse
Kinnunen, Teemu
Nelimarkka, Matti
Pöyhtäri, Reeta
The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring
title The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring
title_full The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring
title_fullStr The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring
title_full_unstemmed The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring
title_short The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring
title_sort datafication of hate: expectations and challenges in automated hate speech monitoring
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931925/
https://www.ncbi.nlm.nih.gov/pubmed/33693378
http://dx.doi.org/10.3389/fdata.2020.00003
work_keys_str_mv AT laaksonensallamaaria thedataficationofhateexpectationsandchallengesinautomatedhatespeechmonitoring
AT haapojajesse thedataficationofhateexpectationsandchallengesinautomatedhatespeechmonitoring
AT kinnunenteemu thedataficationofhateexpectationsandchallengesinautomatedhatespeechmonitoring
AT nelimarkkamatti thedataficationofhateexpectationsandchallengesinautomatedhatespeechmonitoring
AT poyhtarireeta thedataficationofhateexpectationsandchallengesinautomatedhatespeechmonitoring
AT laaksonensallamaaria dataficationofhateexpectationsandchallengesinautomatedhatespeechmonitoring
AT haapojajesse dataficationofhateexpectationsandchallengesinautomatedhatespeechmonitoring
AT kinnunenteemu dataficationofhateexpectationsandchallengesinautomatedhatespeechmonitoring
AT nelimarkkamatti dataficationofhateexpectationsandchallengesinautomatedhatespeechmonitoring
AT poyhtarireeta dataficationofhateexpectationsandchallengesinautomatedhatespeechmonitoring