Cargando…
Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments
As the amounts of data and use of distributed systems for data storage and processing have increased, reducing the number of replications has turned into a crucial requirement in these systems, which has been addressed by plenty of research. In this paper, an algorithm has been proposed to reduce th...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8270174/ https://www.ncbi.nlm.nih.gov/pubmed/34242314 http://dx.doi.org/10.1371/journal.pone.0254210 |
_version_ | 1783720747002232832 |
---|---|
author | Sabaghian, Khatereh Khamforoosh, Keyhan Ghaderzadeh, Abdolbaghi |
author_facet | Sabaghian, Khatereh Khamforoosh, Keyhan Ghaderzadeh, Abdolbaghi |
author_sort | Sabaghian, Khatereh |
collection | PubMed |
description | As the amounts of data and use of distributed systems for data storage and processing have increased, reducing the number of replications has turned into a crucial requirement in these systems, which has been addressed by plenty of research. In this paper, an algorithm has been proposed to reduce the number of replications in big data transfer and, eventually to lower the traffic load over the grid by classifying data efficiently and optimally based on the sent data types and using VIKOR as a method of multivariate decision-making for ranking replication sites. Considering different variables, the VIKOR method makes it possible to take all the parameters effective in the assessment of site ranks into account. According to the results and evaluations, the proposed method has exhibited an improvement by about thirty percent in average over the LRU, LFU, BHR, and Without Rep. algorithms. Furthermore, it has improved the existing multivariate methods through different approaches to replication by thirty percent, as it considers effective parameters such as time, the number of replications, and replication site, causing replication to occur when it can make an improvement in terms of access. |
format | Online Article Text |
id | pubmed-8270174 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-82701742021-07-21 Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments Sabaghian, Khatereh Khamforoosh, Keyhan Ghaderzadeh, Abdolbaghi PLoS One Research Article As the amounts of data and use of distributed systems for data storage and processing have increased, reducing the number of replications has turned into a crucial requirement in these systems, which has been addressed by plenty of research. In this paper, an algorithm has been proposed to reduce the number of replications in big data transfer and, eventually to lower the traffic load over the grid by classifying data efficiently and optimally based on the sent data types and using VIKOR as a method of multivariate decision-making for ranking replication sites. Considering different variables, the VIKOR method makes it possible to take all the parameters effective in the assessment of site ranks into account. According to the results and evaluations, the proposed method has exhibited an improvement by about thirty percent in average over the LRU, LFU, BHR, and Without Rep. algorithms. Furthermore, it has improved the existing multivariate methods through different approaches to replication by thirty percent, as it considers effective parameters such as time, the number of replications, and replication site, causing replication to occur when it can make an improvement in terms of access. Public Library of Science 2021-07-09 /pmc/articles/PMC8270174/ /pubmed/34242314 http://dx.doi.org/10.1371/journal.pone.0254210 Text en © 2021 Sabaghian et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Sabaghian, Khatereh Khamforoosh, Keyhan Ghaderzadeh, Abdolbaghi Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments |
title | Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments |
title_full | Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments |
title_fullStr | Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments |
title_full_unstemmed | Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments |
title_short | Presentation of a new method based on modern multivariate approaches for big data replication in distributed environments |
title_sort | presentation of a new method based on modern multivariate approaches for big data replication in distributed environments |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8270174/ https://www.ncbi.nlm.nih.gov/pubmed/34242314 http://dx.doi.org/10.1371/journal.pone.0254210 |
work_keys_str_mv | AT sabaghiankhatereh presentationofanewmethodbasedonmodernmultivariateapproachesforbigdatareplicationindistributedenvironments AT khamforooshkeyhan presentationofanewmethodbasedonmodernmultivariateapproachesforbigdatareplicationindistributedenvironments AT ghaderzadehabdolbaghi presentationofanewmethodbasedonmodernmultivariateapproachesforbigdatareplicationindistributedenvironments |