Cargando…

A survey on platforms for big data analytics

The primary purpose of this paper is to provide an in-depth analysis of different platforms available for performing big data analytics. This paper surveys different hardware platforms available for big data analytics and assesses the advantages and drawbacks of each of these platforms based on vari...

Descripción completa

Detalles Bibliográficos
Autores principales: Singh, Dilpreet, Reddy, Chandan K
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4505391/
https://www.ncbi.nlm.nih.gov/pubmed/26191487
http://dx.doi.org/10.1186/s40537-014-0008-6
_version_ 1782381576730968064
author Singh, Dilpreet
Reddy, Chandan K
author_facet Singh, Dilpreet
Reddy, Chandan K
author_sort Singh, Dilpreet
collection PubMed
description The primary purpose of this paper is to provide an in-depth analysis of different platforms available for performing big data analytics. This paper surveys different hardware platforms available for big data analytics and assesses the advantages and drawbacks of each of these platforms based on various metrics such as scalability, data I/O rate, fault tolerance, real-time processing, data size supported and iterative task support. In addition to the hardware, a detailed description of the software frameworks used within each of these platforms is also discussed along with their strengths and drawbacks. Some of the critical characteristics described here can potentially aid the readers in making an informed decision about the right choice of platforms depending on their computational needs. Using a star ratings table, a rigorous qualitative comparison between different platforms is also discussed for each of the six characteristics that are critical for the algorithms of big data analytics. In order to provide more insights into the effectiveness of each of the platform in the context of big data analytics, specific implementation level details of the widely used k-means clustering algorithm on various platforms are also described in the form pseudocode.
format Online
Article
Text
id pubmed-4505391
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-45053912015-07-17 A survey on platforms for big data analytics Singh, Dilpreet Reddy, Chandan K J Big Data Survey Paper The primary purpose of this paper is to provide an in-depth analysis of different platforms available for performing big data analytics. This paper surveys different hardware platforms available for big data analytics and assesses the advantages and drawbacks of each of these platforms based on various metrics such as scalability, data I/O rate, fault tolerance, real-time processing, data size supported and iterative task support. In addition to the hardware, a detailed description of the software frameworks used within each of these platforms is also discussed along with their strengths and drawbacks. Some of the critical characteristics described here can potentially aid the readers in making an informed decision about the right choice of platforms depending on their computational needs. Using a star ratings table, a rigorous qualitative comparison between different platforms is also discussed for each of the six characteristics that are critical for the algorithms of big data analytics. In order to provide more insights into the effectiveness of each of the platform in the context of big data analytics, specific implementation level details of the widely used k-means clustering algorithm on various platforms are also described in the form pseudocode. Springer International Publishing 2014-10-09 2015 /pmc/articles/PMC4505391/ /pubmed/26191487 http://dx.doi.org/10.1186/s40537-014-0008-6 Text en © Singh and Reddy; licensee Springer 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Survey Paper
Singh, Dilpreet
Reddy, Chandan K
A survey on platforms for big data analytics
title A survey on platforms for big data analytics
title_full A survey on platforms for big data analytics
title_fullStr A survey on platforms for big data analytics
title_full_unstemmed A survey on platforms for big data analytics
title_short A survey on platforms for big data analytics
title_sort survey on platforms for big data analytics
topic Survey Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4505391/
https://www.ncbi.nlm.nih.gov/pubmed/26191487
http://dx.doi.org/10.1186/s40537-014-0008-6
work_keys_str_mv AT singhdilpreet asurveyonplatformsforbigdataanalytics
AT reddychandank asurveyonplatformsforbigdataanalytics
AT singhdilpreet surveyonplatformsforbigdataanalytics
AT reddychandank surveyonplatformsforbigdataanalytics