Cargando…

Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset

Real-time data processing and distributed messaging are problems that have been worked on for a long time. As the amount of spatial data being produced has increased, coupled with increasingly complex software solutions being developed, there is a need for platforms that address these needs. In this...

Descripción completa

Detalles Bibliográficos
Autores principales: Özgüven, Yavuz Melih, Eken, Süleyman
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8190525/
https://www.ncbi.nlm.nih.gov/pubmed/34127932
http://dx.doi.org/10.1007/s12652-021-03328-0
_version_ 1783705702540247040
author Özgüven, Yavuz Melih
Eken, Süleyman
author_facet Özgüven, Yavuz Melih
Eken, Süleyman
author_sort Özgüven, Yavuz Melih
collection PubMed
description Real-time data processing and distributed messaging are problems that have been worked on for a long time. As the amount of spatial data being produced has increased, coupled with increasingly complex software solutions being developed, there is a need for platforms that address these needs. In this paper, we present a distributed and light streaming system for combating pandemics and give a case study on spatial analysis of the COVID-19 geo-tagged Twitter dataset. In this system, three of the major components are the translation of tweets matching with user-defined bounding boxes, name entity recognition in tweets, and skyline queries. Apache Pulsar addresses all these components in this paper. With the proposed system, end-users have the capability of getting COVID-19 related information within foreign regions, filtering/searching location, organization, person, and miscellaneous based tweets, and performing skyline based queries. The evaluation of the proposed system is done based on certain characteristics and performance metrics. The study differs greatly from other studies in terms of using distributed computing and big data technologies on spatial data to combat COVID-19. It is concluded that Pulsar is designed to handle large amounts of long-term on disk persistence.
format Online
Article
Text
id pubmed-8190525
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-81905252021-06-10 Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset Özgüven, Yavuz Melih Eken, Süleyman J Ambient Intell Humaniz Comput Original Research Real-time data processing and distributed messaging are problems that have been worked on for a long time. As the amount of spatial data being produced has increased, coupled with increasingly complex software solutions being developed, there is a need for platforms that address these needs. In this paper, we present a distributed and light streaming system for combating pandemics and give a case study on spatial analysis of the COVID-19 geo-tagged Twitter dataset. In this system, three of the major components are the translation of tweets matching with user-defined bounding boxes, name entity recognition in tweets, and skyline queries. Apache Pulsar addresses all these components in this paper. With the proposed system, end-users have the capability of getting COVID-19 related information within foreign regions, filtering/searching location, organization, person, and miscellaneous based tweets, and performing skyline based queries. The evaluation of the proposed system is done based on certain characteristics and performance metrics. The study differs greatly from other studies in terms of using distributed computing and big data technologies on spatial data to combat COVID-19. It is concluded that Pulsar is designed to handle large amounts of long-term on disk persistence. Springer Berlin Heidelberg 2021-06-10 2023 /pmc/articles/PMC8190525/ /pubmed/34127932 http://dx.doi.org/10.1007/s12652-021-03328-0 Text en © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Research
Özgüven, Yavuz Melih
Eken, Süleyman
Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset
title Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset
title_full Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset
title_fullStr Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset
title_full_unstemmed Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset
title_short Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset
title_sort distributed messaging and light streaming system for combating pandemics: a case study on spatial analysis of covid-19 geo-tagged twitter dataset
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8190525/
https://www.ncbi.nlm.nih.gov/pubmed/34127932
http://dx.doi.org/10.1007/s12652-021-03328-0
work_keys_str_mv AT ozguvenyavuzmelih distributedmessagingandlightstreamingsystemforcombatingpandemicsacasestudyonspatialanalysisofcovid19geotaggedtwitterdataset
AT ekensuleyman distributedmessagingandlightstreamingsystemforcombatingpandemicsacasestudyonspatialanalysisofcovid19geotaggedtwitterdataset