Cargando…

Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies

MapReduce is the preferred cloud computing framework used in large data analysis and application processing. MapReduce frameworks currently in place suffer performance degradation due to the adoption of sequential processing approaches with little modification and thus exhibit underutilization of cl...

Descripción completa

Detalles Bibliográficos
Autores principales:	Al-Absi, Ahmed Abdulhakim, Al-Sammarraie, Najeeb Abbas, Shaher Yafooz, Wael Mohamed, Kang, Dae-Ki
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2018
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6207866/ https://www.ncbi.nlm.nih.gov/pubmed/30417014 http://dx.doi.org/10.1155/2018/7501042

_version_	1783366603788779520
author	Al-Absi, Ahmed Abdulhakim Al-Sammarraie, Najeeb Abbas Shaher Yafooz, Wael Mohamed Kang, Dae-Ki
author_facet	Al-Absi, Ahmed Abdulhakim Al-Sammarraie, Najeeb Abbas Shaher Yafooz, Wael Mohamed Kang, Dae-Ki
author_sort	Al-Absi, Ahmed Abdulhakim
collection	PubMed
description	MapReduce is the preferred cloud computing framework used in large data analysis and application processing. MapReduce frameworks currently in place suffer performance degradation due to the adoption of sequential processing approaches with little modification and thus exhibit underutilization of cloud resources. To overcome this drawback and reduce costs, we introduce a Parallel MapReduce (PMR) framework in this paper. We design a novel parallel execution strategy of Map and Reduce worker nodes. Our strategy enables further performance improvement and efficient utilization of cloud resources execution of Map and Reduce functions to utilize multicore environments available with computing nodes. We explain in detail makespan modeling and working principle of the PMR framework in the paper. Performance of PMR is compared with Hadoop through experiments considering three biomedical applications. Experiments conducted for BLAST, CAP3, and DeepBind biomedical applications report makespan time reduction of 38.92%, 18.00%, and 34.62% considering the PMR framework against Hadoop framework. Experiments' results prove that the PMR cloud computing platform proposed is robust, cost-effective, and scalable, which sufficiently supports diverse applications on public and private cloud platforms. Consequently, overall presentation and results indicate that there is good matching between theoretical makespan modeling presented and experimental values investigated.
format	Online Article Text
id	pubmed-6207866
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Hindawi
record_format	MEDLINE/PubMed
spelling	pubmed-62078662018-11-11 Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies Al-Absi, Ahmed Abdulhakim Al-Sammarraie, Najeeb Abbas Shaher Yafooz, Wael Mohamed Kang, Dae-Ki Biomed Res Int Research Article MapReduce is the preferred cloud computing framework used in large data analysis and application processing. MapReduce frameworks currently in place suffer performance degradation due to the adoption of sequential processing approaches with little modification and thus exhibit underutilization of cloud resources. To overcome this drawback and reduce costs, we introduce a Parallel MapReduce (PMR) framework in this paper. We design a novel parallel execution strategy of Map and Reduce worker nodes. Our strategy enables further performance improvement and efficient utilization of cloud resources execution of Map and Reduce functions to utilize multicore environments available with computing nodes. We explain in detail makespan modeling and working principle of the PMR framework in the paper. Performance of PMR is compared with Hadoop through experiments considering three biomedical applications. Experiments conducted for BLAST, CAP3, and DeepBind biomedical applications report makespan time reduction of 38.92%, 18.00%, and 34.62% considering the PMR framework against Hadoop framework. Experiments' results prove that the PMR cloud computing platform proposed is robust, cost-effective, and scalable, which sufficiently supports diverse applications on public and private cloud platforms. Consequently, overall presentation and results indicate that there is good matching between theoretical makespan modeling presented and experimental values investigated. Hindawi 2018-10-17 /pmc/articles/PMC6207866/ /pubmed/30417014 http://dx.doi.org/10.1155/2018/7501042 Text en Copyright © 2018 Ahmed Abdulhakim Al-Absi et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Al-Absi, Ahmed Abdulhakim Al-Sammarraie, Najeeb Abbas Shaher Yafooz, Wael Mohamed Kang, Dae-Ki Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies
title	Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies
title_full	Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies
title_fullStr	Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies
title_full_unstemmed	Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies
title_short	Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies
title_sort	parallel mapreduce: maximizing cloud resource utilization and performance improvement using parallel execution strategies
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6207866/ https://www.ncbi.nlm.nih.gov/pubmed/30417014 http://dx.doi.org/10.1155/2018/7501042
work_keys_str_mv	AT alabsiahmedabdulhakim parallelmapreducemaximizingcloudresourceutilizationandperformanceimprovementusingparallelexecutionstrategies AT alsammarraienajeebabbas parallelmapreducemaximizingcloudresourceutilizationandperformanceimprovementusingparallelexecutionstrategies AT shaheryafoozwaelmohamed parallelmapreducemaximizingcloudresourceutilizationandperformanceimprovementusingparallelexecutionstrategies AT kangdaeki parallelmapreducemaximizingcloudresourceutilizationandperformanceimprovementusingparallelexecutionstrategies

Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies

Ejemplares similares