Cargando…

Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database

Social media has been playing a vital importance in information sharing at massive scale due to its easy access, low cost, and faster dissemination of information. Its competence to disseminate the information across a wide audience has raised a critical challenge to determine the social data proven...

Descripción completa

Detalles Bibliográficos
Autores principales: Rani, Asma, Goyal, Navneet, Gadia, Shashi K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8576806/
https://www.ncbi.nlm.nih.gov/pubmed/34778513
http://dx.doi.org/10.1007/s41060-021-00287-9
_version_ 1784595954304811008
author Rani, Asma
Goyal, Navneet
Gadia, Shashi K.
author_facet Rani, Asma
Goyal, Navneet
Gadia, Shashi K.
author_sort Rani, Asma
collection PubMed
description Social media has been playing a vital importance in information sharing at massive scale due to its easy access, low cost, and faster dissemination of information. Its competence to disseminate the information across a wide audience has raised a critical challenge to determine the social data provenance of digital content. Social Data Provenance describes the origin, derivation process, and transformations of social content throughout its lifecycle. In this paper, we present a Big Social Data Provenance (BSDP) Framework for key-value pair (KVP) database using the novel concept of Zero-Information Loss Database (ZILD). In our proposed framework, a huge volume of social data is first fetched from the social media (Twitter’s Network) through live streaming and simultaneously modelled in a KVP database by using a query-driven approach. The proposed framework is capable in capturing, storing, and querying provenance information for different query sets including select, aggregate, standing/historical, and data update (i.e., insert, delete, update) queries on Big Social Data. We evaluate the performance of proposed framework in terms of provenance capturing overhead for different query sets including select, aggregate, and data update queries, and average execution time for various provenance queries.
format Online
Article
Text
id pubmed-8576806
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-85768062021-11-09 Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database Rani, Asma Goyal, Navneet Gadia, Shashi K. Int J Data Sci Anal Regular Paper Social media has been playing a vital importance in information sharing at massive scale due to its easy access, low cost, and faster dissemination of information. Its competence to disseminate the information across a wide audience has raised a critical challenge to determine the social data provenance of digital content. Social Data Provenance describes the origin, derivation process, and transformations of social content throughout its lifecycle. In this paper, we present a Big Social Data Provenance (BSDP) Framework for key-value pair (KVP) database using the novel concept of Zero-Information Loss Database (ZILD). In our proposed framework, a huge volume of social data is first fetched from the social media (Twitter’s Network) through live streaming and simultaneously modelled in a KVP database by using a query-driven approach. The proposed framework is capable in capturing, storing, and querying provenance information for different query sets including select, aggregate, standing/historical, and data update (i.e., insert, delete, update) queries on Big Social Data. We evaluate the performance of proposed framework in terms of provenance capturing overhead for different query sets including select, aggregate, and data update queries, and average execution time for various provenance queries. Springer International Publishing 2021-11-09 2022 /pmc/articles/PMC8576806/ /pubmed/34778513 http://dx.doi.org/10.1007/s41060-021-00287-9 Text en © The Author(s), under exclusive licence to Springer Nature Switzerland AG 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Regular Paper
Rani, Asma
Goyal, Navneet
Gadia, Shashi K.
Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database
title Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database
title_full Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database
title_fullStr Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database
title_full_unstemmed Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database
title_short Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database
title_sort big social data provenance framework for zero-information loss key-value pair (kvp) database
topic Regular Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8576806/
https://www.ncbi.nlm.nih.gov/pubmed/34778513
http://dx.doi.org/10.1007/s41060-021-00287-9
work_keys_str_mv AT raniasma bigsocialdataprovenanceframeworkforzeroinformationlosskeyvaluepairkvpdatabase
AT goyalnavneet bigsocialdataprovenanceframeworkforzeroinformationlosskeyvaluepairkvpdatabase
AT gadiashashik bigsocialdataprovenanceframeworkforzeroinformationlosskeyvaluepairkvpdatabase