Cargando…

Replicating File Segments between Multi-Cloud Nodes in a Smart City: A Machine Learning Approach

The design and management of smart cities and the IoT is a multidimensional problem. One of those dimensions is cloud and edge computing management. Due to the complexity of the problem, resource sharing is one of the vital and major components that when enhanced, the performance of the whole system...

Descripción completa

Detalles Bibliográficos
Autores principales: Mostafa, Nour, Kotb, Yehia, Al-Arnaout, Zakwan, Alabed, Samer, Shdefat, Ahmed Younes
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10221342/
https://www.ncbi.nlm.nih.gov/pubmed/37430552
http://dx.doi.org/10.3390/s23104639
_version_ 1785049433204850688
author Mostafa, Nour
Kotb, Yehia
Al-Arnaout, Zakwan
Alabed, Samer
Shdefat, Ahmed Younes
author_facet Mostafa, Nour
Kotb, Yehia
Al-Arnaout, Zakwan
Alabed, Samer
Shdefat, Ahmed Younes
author_sort Mostafa, Nour
collection PubMed
description The design and management of smart cities and the IoT is a multidimensional problem. One of those dimensions is cloud and edge computing management. Due to the complexity of the problem, resource sharing is one of the vital and major components that when enhanced, the performance of the whole system is enhanced. Research in data access and storage in multi-clouds and edge servers can broadly be classified to data centers and computational centers. The main aim of data centers is to provide services for accessing, sharing and modifying large databases. On the other hand, the aim of computational centers is to provide services for sharing resources. Present and future distributed applications need to deal with very large multi-petabyte datasets and increasing numbers of associated users and resources. The emergence of IoT-based, multi-cloud systems as a potential solution for large computational and data management problems has initiated significant research activity in the area. Due to the considerable increase in data production and data sharing within scientific communities, the need for improvements in data access and data availability cannot be overlooked. It can be argued that the current approaches of large dataset management do not solve all problems associated with big data and large datasets. The heterogeneity and veracity of big data require careful management. One of the issues for managing big data in a multi-cloud system is the scalability and expendability of the system under consideration. Data replication ensures server load balancing, data availability and improved data access time. The proposed model minimises the cost of data services through minimising a cost function that takes storage cost, host access cost and communication cost into consideration. The relative weights between different components is learned through history and it is different from a cloud to another. The model ensures that data are replicated in a way that increases availability while at the same time decreasing the overall cost of data storage and access time. Using the proposed model avoids the overheads of the traditional full replication techniques. The proposed model is mathematically proven to be sound and valid.
format Online
Article
Text
id pubmed-10221342
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102213422023-05-28 Replicating File Segments between Multi-Cloud Nodes in a Smart City: A Machine Learning Approach Mostafa, Nour Kotb, Yehia Al-Arnaout, Zakwan Alabed, Samer Shdefat, Ahmed Younes Sensors (Basel) Article The design and management of smart cities and the IoT is a multidimensional problem. One of those dimensions is cloud and edge computing management. Due to the complexity of the problem, resource sharing is one of the vital and major components that when enhanced, the performance of the whole system is enhanced. Research in data access and storage in multi-clouds and edge servers can broadly be classified to data centers and computational centers. The main aim of data centers is to provide services for accessing, sharing and modifying large databases. On the other hand, the aim of computational centers is to provide services for sharing resources. Present and future distributed applications need to deal with very large multi-petabyte datasets and increasing numbers of associated users and resources. The emergence of IoT-based, multi-cloud systems as a potential solution for large computational and data management problems has initiated significant research activity in the area. Due to the considerable increase in data production and data sharing within scientific communities, the need for improvements in data access and data availability cannot be overlooked. It can be argued that the current approaches of large dataset management do not solve all problems associated with big data and large datasets. The heterogeneity and veracity of big data require careful management. One of the issues for managing big data in a multi-cloud system is the scalability and expendability of the system under consideration. Data replication ensures server load balancing, data availability and improved data access time. The proposed model minimises the cost of data services through minimising a cost function that takes storage cost, host access cost and communication cost into consideration. The relative weights between different components is learned through history and it is different from a cloud to another. The model ensures that data are replicated in a way that increases availability while at the same time decreasing the overall cost of data storage and access time. Using the proposed model avoids the overheads of the traditional full replication techniques. The proposed model is mathematically proven to be sound and valid. MDPI 2023-05-10 /pmc/articles/PMC10221342/ /pubmed/37430552 http://dx.doi.org/10.3390/s23104639 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mostafa, Nour
Kotb, Yehia
Al-Arnaout, Zakwan
Alabed, Samer
Shdefat, Ahmed Younes
Replicating File Segments between Multi-Cloud Nodes in a Smart City: A Machine Learning Approach
title Replicating File Segments between Multi-Cloud Nodes in a Smart City: A Machine Learning Approach
title_full Replicating File Segments between Multi-Cloud Nodes in a Smart City: A Machine Learning Approach
title_fullStr Replicating File Segments between Multi-Cloud Nodes in a Smart City: A Machine Learning Approach
title_full_unstemmed Replicating File Segments between Multi-Cloud Nodes in a Smart City: A Machine Learning Approach
title_short Replicating File Segments between Multi-Cloud Nodes in a Smart City: A Machine Learning Approach
title_sort replicating file segments between multi-cloud nodes in a smart city: a machine learning approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10221342/
https://www.ncbi.nlm.nih.gov/pubmed/37430552
http://dx.doi.org/10.3390/s23104639
work_keys_str_mv AT mostafanour replicatingfilesegmentsbetweenmulticloudnodesinasmartcityamachinelearningapproach
AT kotbyehia replicatingfilesegmentsbetweenmulticloudnodesinasmartcityamachinelearningapproach
AT alarnaoutzakwan replicatingfilesegmentsbetweenmulticloudnodesinasmartcityamachinelearningapproach
AT alabedsamer replicatingfilesegmentsbetweenmulticloudnodesinasmartcityamachinelearningapproach
AT shdefatahmedyounes replicatingfilesegmentsbetweenmulticloudnodesinasmartcityamachinelearningapproach