Cargando…
Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP)
Background : As the standard for the exchange of data over the World Wide Web, it is important to ensure that the eXtensible Markup Language (XML) database is capable of supporting not only efficient query processing but also capable of enduring frequent data update operations over the dynamic chang...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000 Research Limited
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8772529/ https://www.ncbi.nlm.nih.gov/pubmed/35106138 http://dx.doi.org/10.12688/f1000research.69108.1 |
_version_ | 1784635867428552704 |
---|---|
author | Haw, Su-Cheng Amin, Aisyah Wong, Chee-Onn Subramaniam, Samini |
author_facet | Haw, Su-Cheng Amin, Aisyah Wong, Chee-Onn Subramaniam, Samini |
author_sort | Haw, Su-Cheng |
collection | PubMed |
description | Background : As the standard for the exchange of data over the World Wide Web, it is important to ensure that the eXtensible Markup Language (XML) database is capable of supporting not only efficient query processing but also capable of enduring frequent data update operations over the dynamic changes of Web content. Most of the existing XML annotation is based on a labeling scheme to identify each hierarchical position of the XML nodes. This computation is costly as any updates will cause the whole XML tree to be re-labelled. This impact can be observed on large datasets. Therefore, a robust labeling scheme that avoids re-labeling is crucial. Method: Here, we present ORD-GAP (named after Order Gap), a robust and persistent XML labeling scheme that supports dynamic updates. ORD-GAP assigns unique identifiers with gaps in-between XML nodes, which could easily identify the level, Parent-Child (P-C), Ancestor-Descendant (A-D) and sibling relationship. ORD-GAP adopts the OrdPath labeling scheme for any future insertion. Results: We demonstrate that ORD-GAP is robust enough for dynamic updates, and have implemented it in three use cases: (i) left-most, (ii) in-between and (iii) right-most insertion. Experimental evaluations on DBLP dataset demonstrated that ORD-GAP outperformed existing approaches such as ORDPath and ME Labeling concerning database storage size, data loading time and query retrieval. On average, ORD-GAP has the best storing and query retrieval time. Conclusion: The main contributions of this paper are: (i) A robust labeling scheme named ORD-GAP that assigns certain gap between each node to support future insertion, and (ii) An efficient mapping scheme, which built upon ORD-GAP labeling scheme to transform XML into RDB effectively. |
format | Online Article Text |
id | pubmed-8772529 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | F1000 Research Limited |
record_format | MEDLINE/PubMed |
spelling | pubmed-87725292022-01-31 Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) Haw, Su-Cheng Amin, Aisyah Wong, Chee-Onn Subramaniam, Samini F1000Res Research Article Background : As the standard for the exchange of data over the World Wide Web, it is important to ensure that the eXtensible Markup Language (XML) database is capable of supporting not only efficient query processing but also capable of enduring frequent data update operations over the dynamic changes of Web content. Most of the existing XML annotation is based on a labeling scheme to identify each hierarchical position of the XML nodes. This computation is costly as any updates will cause the whole XML tree to be re-labelled. This impact can be observed on large datasets. Therefore, a robust labeling scheme that avoids re-labeling is crucial. Method: Here, we present ORD-GAP (named after Order Gap), a robust and persistent XML labeling scheme that supports dynamic updates. ORD-GAP assigns unique identifiers with gaps in-between XML nodes, which could easily identify the level, Parent-Child (P-C), Ancestor-Descendant (A-D) and sibling relationship. ORD-GAP adopts the OrdPath labeling scheme for any future insertion. Results: We demonstrate that ORD-GAP is robust enough for dynamic updates, and have implemented it in three use cases: (i) left-most, (ii) in-between and (iii) right-most insertion. Experimental evaluations on DBLP dataset demonstrated that ORD-GAP outperformed existing approaches such as ORDPath and ME Labeling concerning database storage size, data loading time and query retrieval. On average, ORD-GAP has the best storing and query retrieval time. Conclusion: The main contributions of this paper are: (i) A robust labeling scheme named ORD-GAP that assigns certain gap between each node to support future insertion, and (ii) An efficient mapping scheme, which built upon ORD-GAP labeling scheme to transform XML into RDB effectively. F1000 Research Limited 2021-09-09 /pmc/articles/PMC8772529/ /pubmed/35106138 http://dx.doi.org/10.12688/f1000research.69108.1 Text en Copyright: © 2021 Haw SC et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Haw, Su-Cheng Amin, Aisyah Wong, Chee-Onn Subramaniam, Samini Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) |
title | Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) |
title_full | Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) |
title_fullStr | Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) |
title_full_unstemmed | Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) |
title_short | Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) |
title_sort | improving the support for xml dynamic updates using a hybridization labeling scheme (ord-gap) |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8772529/ https://www.ncbi.nlm.nih.gov/pubmed/35106138 http://dx.doi.org/10.12688/f1000research.69108.1 |
work_keys_str_mv | AT hawsucheng improvingthesupportforxmldynamicupdatesusingahybridizationlabelingschemeordgap AT aminaisyah improvingthesupportforxmldynamicupdatesusingahybridizationlabelingschemeordgap AT wongcheeonn improvingthesupportforxmldynamicupdatesusingahybridizationlabelingschemeordgap AT subramaniamsamini improvingthesupportforxmldynamicupdatesusingahybridizationlabelingschemeordgap |