Cargando…

Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments

As the amounts of data and use of distributed systems for data storage and processing have increased, reducing the number of replications has turned into a crucial requirement in these systems, which has been addressed by plenty of research. In this paper, an algorithm has been proposed to reduce th...

Descripción completa

Detalles Bibliográficos
Autores principales: Sabaghian, Khatereh, Khamforoosh, Keyhan, Ghaderzadeh, Abdolbaghi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8270174/
https://www.ncbi.nlm.nih.gov/pubmed/34242314
http://dx.doi.org/10.1371/journal.pone.0254210
_version_ 1783720747002232832
author Sabaghian, Khatereh
Khamforoosh, Keyhan
Ghaderzadeh, Abdolbaghi
author_facet Sabaghian, Khatereh
Khamforoosh, Keyhan
Ghaderzadeh, Abdolbaghi
author_sort Sabaghian, Khatereh
collection PubMed
description As the amounts of data and use of distributed systems for data storage and processing have increased, reducing the number of replications has turned into a crucial requirement in these systems, which has been addressed by plenty of research. In this paper, an algorithm has been proposed to reduce the number of replications in big data transfer and, eventually to lower the traffic load over the grid by classifying data efficiently and optimally based on the sent data types and using VIKOR as a method of multivariate decision-making for ranking replication sites. Considering different variables, the VIKOR method makes it possible to take all the parameters effective in the assessment of site ranks into account. According to the results and evaluations, the proposed method has exhibited an improvement by about thirty percent in average over the LRU, LFU, BHR, and Without Rep. algorithms. Furthermore, it has improved the existing multivariate methods through different approaches to replication by thirty percent, as it considers effective parameters such as time, the number of replications, and replication site, causing replication to occur when it can make an improvement in terms of access.
format Online
Article
Text
id pubmed-8270174
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-82701742021-07-21 Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments Sabaghian, Khatereh Khamforoosh, Keyhan Ghaderzadeh, Abdolbaghi PLoS One Research Article As the amounts of data and use of distributed systems for data storage and processing have increased, reducing the number of replications has turned into a crucial requirement in these systems, which has been addressed by plenty of research. In this paper, an algorithm has been proposed to reduce the number of replications in big data transfer and, eventually to lower the traffic load over the grid by classifying data efficiently and optimally based on the sent data types and using VIKOR as a method of multivariate decision-making for ranking replication sites. Considering different variables, the VIKOR method makes it possible to take all the parameters effective in the assessment of site ranks into account. According to the results and evaluations, the proposed method has exhibited an improvement by about thirty percent in average over the LRU, LFU, BHR, and Without Rep. algorithms. Furthermore, it has improved the existing multivariate methods through different approaches to replication by thirty percent, as it considers effective parameters such as time, the number of replications, and replication site, causing replication to occur when it can make an improvement in terms of access. Public Library of Science 2021-07-09 /pmc/articles/PMC8270174/ /pubmed/34242314 http://dx.doi.org/10.1371/journal.pone.0254210 Text en © 2021 Sabaghian et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Sabaghian, Khatereh
Khamforoosh, Keyhan
Ghaderzadeh, Abdolbaghi
Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments
title Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments
title_full Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments
title_fullStr Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments
title_full_unstemmed Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments
title_short Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments
title_sort presentation of a new method based on modern multivariate approaches for big data replication in distributed environments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8270174/
https://www.ncbi.nlm.nih.gov/pubmed/34242314
http://dx.doi.org/10.1371/journal.pone.0254210
work_keys_str_mv AT sabaghiankhatereh presentationofanewmethodbasedonmodernmultivariateapproachesforbigdatareplicationindistributedenvironments
AT khamforooshkeyhan presentationofanewmethodbasedonmodernmultivariateapproachesforbigdatareplicationindistributedenvironments
AT ghaderzadehabdolbaghi presentationofanewmethodbasedonmodernmultivariateapproachesforbigdatareplicationindistributedenvironments