Cargando…

Entropy-Based Approach for the Detection of Changes in Arabic Newspapers’ Content

A new method for the recognition of meaningful changes in social state based on transformations of the linguistic content in Arabic newspapers is suggested. The detected alterations of the linguistic material in Arabic newspapers play an indicator role. The currently proposed approach acts in an “on...

Descripción completa

Detalles Bibliográficos
Autores principales: Bernikova, Olga, Granichin, Oleg, Lemberg, Dan, Redkin, Oleg, Volkovich, Zeev
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516919/
https://www.ncbi.nlm.nih.gov/pubmed/33286215
http://dx.doi.org/10.3390/e22040441
_version_ 1783587109507956736
author Bernikova, Olga
Granichin, Oleg
Lemberg, Dan
Redkin, Oleg
Volkovich, Zeev
author_facet Bernikova, Olga
Granichin, Oleg
Lemberg, Dan
Redkin, Oleg
Volkovich, Zeev
author_sort Bernikova, Olga
collection PubMed
description A new method for the recognition of meaningful changes in social state based on transformations of the linguistic content in Arabic newspapers is suggested. The detected alterations of the linguistic material in Arabic newspapers play an indicator role. The currently proposed approach acts in an “online” fashion and uses pre-trained vector representations of Arabic words. After a pre-processing stage, the words in the issues’ texts are substituted by vectors obtained within a word embedding methodology. The approach typifies the consistent linguistic template by the similarity of the embedded vectors. A change in the distributions of the issue-grounded samples indicates a difference in the underlying newspaper template. A two-step procedure implements the concept, where the first step compares the similarity distribution of the current issue versus the union of ones corresponding to several of its predecessors. A repeating under-sampling approach accompanied by a two-sample test stabilizes the sampling and returns a collection of the resultant p-values. In the second stage, the entropy of these sets is sequentially calculated, such that the change points of the time series obtained in this way indicate the changes in the newspaper content. Numerical experiments provided on the following issues of several Arabic newspapers published in the Arab Spring period demonstrate the high reliability of the method.
format Online
Article
Text
id pubmed-7516919
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75169192020-11-09 Entropy-Based Approach for the Detection of Changes in Arabic Newspapers’ Content Bernikova, Olga Granichin, Oleg Lemberg, Dan Redkin, Oleg Volkovich, Zeev Entropy (Basel) Article A new method for the recognition of meaningful changes in social state based on transformations of the linguistic content in Arabic newspapers is suggested. The detected alterations of the linguistic material in Arabic newspapers play an indicator role. The currently proposed approach acts in an “online” fashion and uses pre-trained vector representations of Arabic words. After a pre-processing stage, the words in the issues’ texts are substituted by vectors obtained within a word embedding methodology. The approach typifies the consistent linguistic template by the similarity of the embedded vectors. A change in the distributions of the issue-grounded samples indicates a difference in the underlying newspaper template. A two-step procedure implements the concept, where the first step compares the similarity distribution of the current issue versus the union of ones corresponding to several of its predecessors. A repeating under-sampling approach accompanied by a two-sample test stabilizes the sampling and returns a collection of the resultant p-values. In the second stage, the entropy of these sets is sequentially calculated, such that the change points of the time series obtained in this way indicate the changes in the newspaper content. Numerical experiments provided on the following issues of several Arabic newspapers published in the Arab Spring period demonstrate the high reliability of the method. MDPI 2020-04-14 /pmc/articles/PMC7516919/ /pubmed/33286215 http://dx.doi.org/10.3390/e22040441 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Bernikova, Olga
Granichin, Oleg
Lemberg, Dan
Redkin, Oleg
Volkovich, Zeev
Entropy-Based Approach for the Detection of Changes in Arabic Newspapers’ Content
title Entropy-Based Approach for the Detection of Changes in Arabic Newspapers’ Content
title_full Entropy-Based Approach for the Detection of Changes in Arabic Newspapers’ Content
title_fullStr Entropy-Based Approach for the Detection of Changes in Arabic Newspapers’ Content
title_full_unstemmed Entropy-Based Approach for the Detection of Changes in Arabic Newspapers’ Content
title_short Entropy-Based Approach for the Detection of Changes in Arabic Newspapers’ Content
title_sort entropy-based approach for the detection of changes in arabic newspapers’ content
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516919/
https://www.ncbi.nlm.nih.gov/pubmed/33286215
http://dx.doi.org/10.3390/e22040441
work_keys_str_mv AT bernikovaolga entropybasedapproachforthedetectionofchangesinarabicnewspaperscontent
AT granichinoleg entropybasedapproachforthedetectionofchangesinarabicnewspaperscontent
AT lembergdan entropybasedapproachforthedetectionofchangesinarabicnewspaperscontent
AT redkinoleg entropybasedapproachforthedetectionofchangesinarabicnewspaperscontent
AT volkovichzeev entropybasedapproachforthedetectionofchangesinarabicnewspaperscontent