Cargando…

Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP)

Background : As the standard for the exchange of data over the World Wide Web, it is important to ensure that the eXtensible Markup Language (XML) database is capable of supporting not only efficient query processing but also capable of enduring frequent data update operations over the dynamic chang...

Descripción completa

Detalles Bibliográficos
Autores principales: Haw, Su-Cheng, Amin, Aisyah, Wong, Chee-Onn, Subramaniam, Samini
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8772529/
https://www.ncbi.nlm.nih.gov/pubmed/35106138
http://dx.doi.org/10.12688/f1000research.69108.1
_version_ 1784635867428552704
author Haw, Su-Cheng
Amin, Aisyah
Wong, Chee-Onn
Subramaniam, Samini
author_facet Haw, Su-Cheng
Amin, Aisyah
Wong, Chee-Onn
Subramaniam, Samini
author_sort Haw, Su-Cheng
collection PubMed
description Background : As the standard for the exchange of data over the World Wide Web, it is important to ensure that the eXtensible Markup Language (XML) database is capable of supporting not only efficient query processing but also capable of enduring frequent data update operations over the dynamic changes of Web content. Most of the existing XML annotation is based on a labeling scheme to identify each hierarchical position of the XML nodes. This computation is costly as any updates will cause the whole XML tree to be re-labelled. This impact can be observed on large datasets. Therefore, a robust labeling scheme that avoids re-labeling is crucial. Method: Here, we present ORD-GAP (named after Order Gap), a robust and persistent XML labeling scheme that supports dynamic updates. ORD-GAP assigns unique identifiers with gaps in-between XML nodes, which could easily identify the level, Parent-Child (P-C), Ancestor-Descendant (A-D) and sibling relationship. ORD-GAP adopts the OrdPath labeling scheme for any future insertion. Results: We demonstrate that ORD-GAP is robust enough for dynamic updates, and have implemented it in three use cases: (i) left-most, (ii) in-between and (iii) right-most insertion. Experimental evaluations on DBLP dataset demonstrated that ORD-GAP outperformed existing approaches such as ORDPath and ME Labeling concerning database storage size, data loading time and query retrieval. On average, ORD-GAP has the best storing and query retrieval time. Conclusion: The main contributions of this paper are: (i) A robust labeling scheme named ORD-GAP that assigns certain gap between each node to support future insertion, and (ii) An efficient mapping scheme, which built upon ORD-GAP labeling scheme to transform XML into RDB effectively.
format Online
Article
Text
id pubmed-8772529
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-87725292022-01-31 Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) Haw, Su-Cheng Amin, Aisyah Wong, Chee-Onn Subramaniam, Samini F1000Res Research Article Background : As the standard for the exchange of data over the World Wide Web, it is important to ensure that the eXtensible Markup Language (XML) database is capable of supporting not only efficient query processing but also capable of enduring frequent data update operations over the dynamic changes of Web content. Most of the existing XML annotation is based on a labeling scheme to identify each hierarchical position of the XML nodes. This computation is costly as any updates will cause the whole XML tree to be re-labelled. This impact can be observed on large datasets. Therefore, a robust labeling scheme that avoids re-labeling is crucial. Method: Here, we present ORD-GAP (named after Order Gap), a robust and persistent XML labeling scheme that supports dynamic updates. ORD-GAP assigns unique identifiers with gaps in-between XML nodes, which could easily identify the level, Parent-Child (P-C), Ancestor-Descendant (A-D) and sibling relationship. ORD-GAP adopts the OrdPath labeling scheme for any future insertion. Results: We demonstrate that ORD-GAP is robust enough for dynamic updates, and have implemented it in three use cases: (i) left-most, (ii) in-between and (iii) right-most insertion. Experimental evaluations on DBLP dataset demonstrated that ORD-GAP outperformed existing approaches such as ORDPath and ME Labeling concerning database storage size, data loading time and query retrieval. On average, ORD-GAP has the best storing and query retrieval time. Conclusion: The main contributions of this paper are: (i) A robust labeling scheme named ORD-GAP that assigns certain gap between each node to support future insertion, and (ii) An efficient mapping scheme, which built upon ORD-GAP labeling scheme to transform XML into RDB effectively. F1000 Research Limited 2021-09-09 /pmc/articles/PMC8772529/ /pubmed/35106138 http://dx.doi.org/10.12688/f1000research.69108.1 Text en Copyright: © 2021 Haw SC et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Haw, Su-Cheng
Amin, Aisyah
Wong, Chee-Onn
Subramaniam, Samini
Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP)
title Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP)
title_full Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP)
title_fullStr Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP)
title_full_unstemmed Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP)
title_short Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP)
title_sort improving the support for xml dynamic updates using a hybridization labeling scheme (ord-gap)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8772529/
https://www.ncbi.nlm.nih.gov/pubmed/35106138
http://dx.doi.org/10.12688/f1000research.69108.1
work_keys_str_mv AT hawsucheng improvingthesupportforxmldynamicupdatesusingahybridizationlabelingschemeordgap
AT aminaisyah improvingthesupportforxmldynamicupdatesusingahybridizationlabelingschemeordgap
AT wongcheeonn improvingthesupportforxmldynamicupdatesusingahybridizationlabelingschemeordgap
AT subramaniamsamini improvingthesupportforxmldynamicupdatesusingahybridizationlabelingschemeordgap