Cargando…
Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset
Real-time data processing and distributed messaging are problems that have been worked on for a long time. As the amount of spatial data being produced has increased, coupled with increasingly complex software solutions being developed, there is a need for platforms that address these needs. In this...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8190525/ https://www.ncbi.nlm.nih.gov/pubmed/34127932 http://dx.doi.org/10.1007/s12652-021-03328-0 |
_version_ | 1783705702540247040 |
---|---|
author | Özgüven, Yavuz Melih Eken, Süleyman |
author_facet | Özgüven, Yavuz Melih Eken, Süleyman |
author_sort | Özgüven, Yavuz Melih |
collection | PubMed |
description | Real-time data processing and distributed messaging are problems that have been worked on for a long time. As the amount of spatial data being produced has increased, coupled with increasingly complex software solutions being developed, there is a need for platforms that address these needs. In this paper, we present a distributed and light streaming system for combating pandemics and give a case study on spatial analysis of the COVID-19 geo-tagged Twitter dataset. In this system, three of the major components are the translation of tweets matching with user-defined bounding boxes, name entity recognition in tweets, and skyline queries. Apache Pulsar addresses all these components in this paper. With the proposed system, end-users have the capability of getting COVID-19 related information within foreign regions, filtering/searching location, organization, person, and miscellaneous based tweets, and performing skyline based queries. The evaluation of the proposed system is done based on certain characteristics and performance metrics. The study differs greatly from other studies in terms of using distributed computing and big data technologies on spatial data to combat COVID-19. It is concluded that Pulsar is designed to handle large amounts of long-term on disk persistence. |
format | Online Article Text |
id | pubmed-8190525 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-81905252021-06-10 Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset Özgüven, Yavuz Melih Eken, Süleyman J Ambient Intell Humaniz Comput Original Research Real-time data processing and distributed messaging are problems that have been worked on for a long time. As the amount of spatial data being produced has increased, coupled with increasingly complex software solutions being developed, there is a need for platforms that address these needs. In this paper, we present a distributed and light streaming system for combating pandemics and give a case study on spatial analysis of the COVID-19 geo-tagged Twitter dataset. In this system, three of the major components are the translation of tweets matching with user-defined bounding boxes, name entity recognition in tweets, and skyline queries. Apache Pulsar addresses all these components in this paper. With the proposed system, end-users have the capability of getting COVID-19 related information within foreign regions, filtering/searching location, organization, person, and miscellaneous based tweets, and performing skyline based queries. The evaluation of the proposed system is done based on certain characteristics and performance metrics. The study differs greatly from other studies in terms of using distributed computing and big data technologies on spatial data to combat COVID-19. It is concluded that Pulsar is designed to handle large amounts of long-term on disk persistence. Springer Berlin Heidelberg 2021-06-10 2023 /pmc/articles/PMC8190525/ /pubmed/34127932 http://dx.doi.org/10.1007/s12652-021-03328-0 Text en © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Original Research Özgüven, Yavuz Melih Eken, Süleyman Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset |
title | Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset |
title_full | Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset |
title_fullStr | Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset |
title_full_unstemmed | Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset |
title_short | Distributed messaging and light streaming system for combating pandemics: A case study on spatial analysis of COVID-19 Geo-tagged Twitter dataset |
title_sort | distributed messaging and light streaming system for combating pandemics: a case study on spatial analysis of covid-19 geo-tagged twitter dataset |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8190525/ https://www.ncbi.nlm.nih.gov/pubmed/34127932 http://dx.doi.org/10.1007/s12652-021-03328-0 |
work_keys_str_mv | AT ozguvenyavuzmelih distributedmessagingandlightstreamingsystemforcombatingpandemicsacasestudyonspatialanalysisofcovid19geotaggedtwitterdataset AT ekensuleyman distributedmessagingandlightstreamingsystemforcombatingpandemicsacasestudyonspatialanalysisofcovid19geotaggedtwitterdataset |