Cargando…
Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database
Social media has been playing a vital importance in information sharing at massive scale due to its easy access, low cost, and faster dissemination of information. Its competence to disseminate the information across a wide audience has raised a critical challenge to determine the social data proven...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8576806/ https://www.ncbi.nlm.nih.gov/pubmed/34778513 http://dx.doi.org/10.1007/s41060-021-00287-9 |
_version_ | 1784595954304811008 |
---|---|
author | Rani, Asma Goyal, Navneet Gadia, Shashi K. |
author_facet | Rani, Asma Goyal, Navneet Gadia, Shashi K. |
author_sort | Rani, Asma |
collection | PubMed |
description | Social media has been playing a vital importance in information sharing at massive scale due to its easy access, low cost, and faster dissemination of information. Its competence to disseminate the information across a wide audience has raised a critical challenge to determine the social data provenance of digital content. Social Data Provenance describes the origin, derivation process, and transformations of social content throughout its lifecycle. In this paper, we present a Big Social Data Provenance (BSDP) Framework for key-value pair (KVP) database using the novel concept of Zero-Information Loss Database (ZILD). In our proposed framework, a huge volume of social data is first fetched from the social media (Twitter’s Network) through live streaming and simultaneously modelled in a KVP database by using a query-driven approach. The proposed framework is capable in capturing, storing, and querying provenance information for different query sets including select, aggregate, standing/historical, and data update (i.e., insert, delete, update) queries on Big Social Data. We evaluate the performance of proposed framework in terms of provenance capturing overhead for different query sets including select, aggregate, and data update queries, and average execution time for various provenance queries. |
format | Online Article Text |
id | pubmed-8576806 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-85768062021-11-09 Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database Rani, Asma Goyal, Navneet Gadia, Shashi K. Int J Data Sci Anal Regular Paper Social media has been playing a vital importance in information sharing at massive scale due to its easy access, low cost, and faster dissemination of information. Its competence to disseminate the information across a wide audience has raised a critical challenge to determine the social data provenance of digital content. Social Data Provenance describes the origin, derivation process, and transformations of social content throughout its lifecycle. In this paper, we present a Big Social Data Provenance (BSDP) Framework for key-value pair (KVP) database using the novel concept of Zero-Information Loss Database (ZILD). In our proposed framework, a huge volume of social data is first fetched from the social media (Twitter’s Network) through live streaming and simultaneously modelled in a KVP database by using a query-driven approach. The proposed framework is capable in capturing, storing, and querying provenance information for different query sets including select, aggregate, standing/historical, and data update (i.e., insert, delete, update) queries on Big Social Data. We evaluate the performance of proposed framework in terms of provenance capturing overhead for different query sets including select, aggregate, and data update queries, and average execution time for various provenance queries. Springer International Publishing 2021-11-09 2022 /pmc/articles/PMC8576806/ /pubmed/34778513 http://dx.doi.org/10.1007/s41060-021-00287-9 Text en © The Author(s), under exclusive licence to Springer Nature Switzerland AG 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Regular Paper Rani, Asma Goyal, Navneet Gadia, Shashi K. Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database |
title | Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database |
title_full | Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database |
title_fullStr | Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database |
title_full_unstemmed | Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database |
title_short | Big social data provenance framework for Zero-Information Loss Key-Value Pair (KVP) Database |
title_sort | big social data provenance framework for zero-information loss key-value pair (kvp) database |
topic | Regular Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8576806/ https://www.ncbi.nlm.nih.gov/pubmed/34778513 http://dx.doi.org/10.1007/s41060-021-00287-9 |
work_keys_str_mv | AT raniasma bigsocialdataprovenanceframeworkforzeroinformationlosskeyvaluepairkvpdatabase AT goyalnavneet bigsocialdataprovenanceframeworkforzeroinformationlosskeyvaluepairkvpdatabase AT gadiashashik bigsocialdataprovenanceframeworkforzeroinformationlosskeyvaluepairkvpdatabase |