Cargando…

Infoveillance of the Croatian Online Media During the COVID-19 Pandemic: One-Year Longitudinal Study Using Natural Language Processing

BACKGROUND: Online media play an important role in public health emergencies and serve as essential communication platforms. Infoveillance of online media during the COVID-19 pandemic is an important step toward gaining a better understanding of crisis communication. OBJECTIVE: The goal of this stud...

Descripción completa

Detalles Bibliográficos
Autores principales: Beliga, Slobodan, Martinčić-Ipšić, Sanda, Matešić, Mihaela, Petrijevčanin Vuksanović, Irena, Meštrović, Ana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8715984/
https://www.ncbi.nlm.nih.gov/pubmed/34739388
http://dx.doi.org/10.2196/31540
_version_ 1784624227915137024
author Beliga, Slobodan
Martinčić-Ipšić, Sanda
Matešić, Mihaela
Petrijevčanin Vuksanović, Irena
Meštrović, Ana
author_facet Beliga, Slobodan
Martinčić-Ipšić, Sanda
Matešić, Mihaela
Petrijevčanin Vuksanović, Irena
Meštrović, Ana
author_sort Beliga, Slobodan
collection PubMed
description BACKGROUND: Online media play an important role in public health emergencies and serve as essential communication platforms. Infoveillance of online media during the COVID-19 pandemic is an important step toward gaining a better understanding of crisis communication. OBJECTIVE: The goal of this study was to perform a longitudinal analysis of the COVID-19–related content on online media based on natural language processing. METHODS: We collected a data set of news articles published by Croatian online media during the first 13 months of the pandemic. First, we tested the correlations between the number of articles and the number of new daily COVID-19 cases. Second, we analyzed the content by extracting the most frequent terms and applied the Jaccard similarity coefficient. Third, we compared the occurrence of the pandemic-related terms during the two waves of the pandemic. Finally, we applied named entity recognition to extract the most frequent entities and tracked the dynamics of changes during the observation period. RESULTS: The results showed no significant correlation between the number of articles and the number of new daily COVID-19 cases. Furthermore, there were high overlaps in the terminology used in all articles published during the pandemic with a slight shift in the pandemic-related terms between the first and the second waves. Finally, the findings indicate that the most influential entities have lower overlaps for the identified people and higher overlaps for locations and institutions. CONCLUSIONS: Our study shows that online media have a prompt response to the pandemic with a large number of COVID-19–related articles. There was a high overlap in the frequently used terms across the first 13 months, which may indicate the narrow focus of reporting in certain periods. However, the pandemic-related terminology is well-covered.
format Online
Article
Text
id pubmed-8715984
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-87159842022-01-14 Infoveillance of the Croatian Online Media During the COVID-19 Pandemic: One-Year Longitudinal Study Using Natural Language Processing Beliga, Slobodan Martinčić-Ipšić, Sanda Matešić, Mihaela Petrijevčanin Vuksanović, Irena Meštrović, Ana JMIR Public Health Surveill Original Paper BACKGROUND: Online media play an important role in public health emergencies and serve as essential communication platforms. Infoveillance of online media during the COVID-19 pandemic is an important step toward gaining a better understanding of crisis communication. OBJECTIVE: The goal of this study was to perform a longitudinal analysis of the COVID-19–related content on online media based on natural language processing. METHODS: We collected a data set of news articles published by Croatian online media during the first 13 months of the pandemic. First, we tested the correlations between the number of articles and the number of new daily COVID-19 cases. Second, we analyzed the content by extracting the most frequent terms and applied the Jaccard similarity coefficient. Third, we compared the occurrence of the pandemic-related terms during the two waves of the pandemic. Finally, we applied named entity recognition to extract the most frequent entities and tracked the dynamics of changes during the observation period. RESULTS: The results showed no significant correlation between the number of articles and the number of new daily COVID-19 cases. Furthermore, there were high overlaps in the terminology used in all articles published during the pandemic with a slight shift in the pandemic-related terms between the first and the second waves. Finally, the findings indicate that the most influential entities have lower overlaps for the identified people and higher overlaps for locations and institutions. CONCLUSIONS: Our study shows that online media have a prompt response to the pandemic with a large number of COVID-19–related articles. There was a high overlap in the frequently used terms across the first 13 months, which may indicate the narrow focus of reporting in certain periods. However, the pandemic-related terminology is well-covered. JMIR Publications 2021-12-24 /pmc/articles/PMC8715984/ /pubmed/34739388 http://dx.doi.org/10.2196/31540 Text en ©Slobodan Beliga, Sanda Martinčić-Ipšić, Mihaela Matešić, Irena Petrijevčanin Vuksanović, Ana Meštrović. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 24.12.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on https://publichealth.jmir.org, as well as this copyright and license information must be included.
spellingShingle Original Paper
Beliga, Slobodan
Martinčić-Ipšić, Sanda
Matešić, Mihaela
Petrijevčanin Vuksanović, Irena
Meštrović, Ana
Infoveillance of the Croatian Online Media During the COVID-19 Pandemic: One-Year Longitudinal Study Using Natural Language Processing
title Infoveillance of the Croatian Online Media During the COVID-19 Pandemic: One-Year Longitudinal Study Using Natural Language Processing
title_full Infoveillance of the Croatian Online Media During the COVID-19 Pandemic: One-Year Longitudinal Study Using Natural Language Processing
title_fullStr Infoveillance of the Croatian Online Media During the COVID-19 Pandemic: One-Year Longitudinal Study Using Natural Language Processing
title_full_unstemmed Infoveillance of the Croatian Online Media During the COVID-19 Pandemic: One-Year Longitudinal Study Using Natural Language Processing
title_short Infoveillance of the Croatian Online Media During the COVID-19 Pandemic: One-Year Longitudinal Study Using Natural Language Processing
title_sort infoveillance of the croatian online media during the covid-19 pandemic: one-year longitudinal study using natural language processing
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8715984/
https://www.ncbi.nlm.nih.gov/pubmed/34739388
http://dx.doi.org/10.2196/31540
work_keys_str_mv AT beligaslobodan infoveillanceofthecroatianonlinemediaduringthecovid19pandemiconeyearlongitudinalstudyusingnaturallanguageprocessing
AT martincicipsicsanda infoveillanceofthecroatianonlinemediaduringthecovid19pandemiconeyearlongitudinalstudyusingnaturallanguageprocessing
AT matesicmihaela infoveillanceofthecroatianonlinemediaduringthecovid19pandemiconeyearlongitudinalstudyusingnaturallanguageprocessing
AT petrijevcaninvuksanovicirena infoveillanceofthecroatianonlinemediaduringthecovid19pandemiconeyearlongitudinalstudyusingnaturallanguageprocessing
AT mestrovicana infoveillanceofthecroatianonlinemediaduringthecovid19pandemiconeyearlongitudinalstudyusingnaturallanguageprocessing