Cargando…

A web framework for information aggregation and management of multilingual hate speech

Social media platforms have led to the creation of a vast amount of information produced by users and published publicly, facilitating participation in the public sphere, but also giving the opportunity for certain users to publish hateful content. This content mainly involves offensive/discriminati...

Descripción completa

Detalles Bibliográficos
Autores principales: Kotsakis, Rigas, Vrysis, Lazaros, Vryzas, Nikolaos, Saridou, Theodora, Matsiola, Maria, Veglis, Andreas, Dimoulas, Charalampos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10196859/
https://www.ncbi.nlm.nih.gov/pubmed/37215824
http://dx.doi.org/10.1016/j.heliyon.2023.e16084
_version_ 1785044434457460736
author Kotsakis, Rigas
Vrysis, Lazaros
Vryzas, Nikolaos
Saridou, Theodora
Matsiola, Maria
Veglis, Andreas
Dimoulas, Charalampos
author_facet Kotsakis, Rigas
Vrysis, Lazaros
Vryzas, Nikolaos
Saridou, Theodora
Matsiola, Maria
Veglis, Andreas
Dimoulas, Charalampos
author_sort Kotsakis, Rigas
collection PubMed
description Social media platforms have led to the creation of a vast amount of information produced by users and published publicly, facilitating participation in the public sphere, but also giving the opportunity for certain users to publish hateful content. This content mainly involves offensive/discriminative speech towards social groups or individuals (based on racial, religious, gender or other characteristics) and could possibly lead into subsequent hate actions/crimes due to persistent escalation. Content management and moderation in big data volumes can no longer be supported manually. In the current research, a web framework is presented and evaluated for the collection, analysis, and aggregation of multilingual textual content from various online sources. The framework is designed to address the needs of human users, journalists, academics, and the public to collect and analyze content from social media and the web in Spanish, Italian, Greek, and English, without prior training or a background in Computer Science. The backend functionality provides content collection and monitoring, semantic analysis including hate speech detection and sentiment analysis using machine learning models and rule-based algorithms, storing, querying, and retrieving such content along with the relevant metadata in a database. This functionality is assessed through a graphic user interface that is accessed using a web browser. An evaluation procedure was held through online questionnaires, including journalists and students, proving the feasibility of the use of the proposed framework by non-experts for the defined use-case scenarios.
format Online
Article
Text
id pubmed-10196859
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-101968592023-05-20 A web framework for information aggregation and management of multilingual hate speech Kotsakis, Rigas Vrysis, Lazaros Vryzas, Nikolaos Saridou, Theodora Matsiola, Maria Veglis, Andreas Dimoulas, Charalampos Heliyon Research Article Social media platforms have led to the creation of a vast amount of information produced by users and published publicly, facilitating participation in the public sphere, but also giving the opportunity for certain users to publish hateful content. This content mainly involves offensive/discriminative speech towards social groups or individuals (based on racial, religious, gender or other characteristics) and could possibly lead into subsequent hate actions/crimes due to persistent escalation. Content management and moderation in big data volumes can no longer be supported manually. In the current research, a web framework is presented and evaluated for the collection, analysis, and aggregation of multilingual textual content from various online sources. The framework is designed to address the needs of human users, journalists, academics, and the public to collect and analyze content from social media and the web in Spanish, Italian, Greek, and English, without prior training or a background in Computer Science. The backend functionality provides content collection and monitoring, semantic analysis including hate speech detection and sentiment analysis using machine learning models and rule-based algorithms, storing, querying, and retrieving such content along with the relevant metadata in a database. This functionality is assessed through a graphic user interface that is accessed using a web browser. An evaluation procedure was held through online questionnaires, including journalists and students, proving the feasibility of the use of the proposed framework by non-experts for the defined use-case scenarios. Elsevier 2023-05-09 /pmc/articles/PMC10196859/ /pubmed/37215824 http://dx.doi.org/10.1016/j.heliyon.2023.e16084 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Kotsakis, Rigas
Vrysis, Lazaros
Vryzas, Nikolaos
Saridou, Theodora
Matsiola, Maria
Veglis, Andreas
Dimoulas, Charalampos
A web framework for information aggregation and management of multilingual hate speech
title A web framework for information aggregation and management of multilingual hate speech
title_full A web framework for information aggregation and management of multilingual hate speech
title_fullStr A web framework for information aggregation and management of multilingual hate speech
title_full_unstemmed A web framework for information aggregation and management of multilingual hate speech
title_short A web framework for information aggregation and management of multilingual hate speech
title_sort web framework for information aggregation and management of multilingual hate speech
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10196859/
https://www.ncbi.nlm.nih.gov/pubmed/37215824
http://dx.doi.org/10.1016/j.heliyon.2023.e16084
work_keys_str_mv AT kotsakisrigas awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech
AT vrysislazaros awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech
AT vryzasnikolaos awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech
AT saridoutheodora awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech
AT matsiolamaria awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech
AT veglisandreas awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech
AT dimoulascharalampos awebframeworkforinformationaggregationandmanagementofmultilingualhatespeech
AT kotsakisrigas webframeworkforinformationaggregationandmanagementofmultilingualhatespeech
AT vrysislazaros webframeworkforinformationaggregationandmanagementofmultilingualhatespeech
AT vryzasnikolaos webframeworkforinformationaggregationandmanagementofmultilingualhatespeech
AT saridoutheodora webframeworkforinformationaggregationandmanagementofmultilingualhatespeech
AT matsiolamaria webframeworkforinformationaggregationandmanagementofmultilingualhatespeech
AT veglisandreas webframeworkforinformationaggregationandmanagementofmultilingualhatespeech
AT dimoulascharalampos webframeworkforinformationaggregationandmanagementofmultilingualhatespeech