Cargando…
A survey on platforms for big data analytics
The primary purpose of this paper is to provide an in-depth analysis of different platforms available for performing big data analytics. This paper surveys different hardware platforms available for big data analytics and assesses the advantages and drawbacks of each of these platforms based on vari...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4505391/ https://www.ncbi.nlm.nih.gov/pubmed/26191487 http://dx.doi.org/10.1186/s40537-014-0008-6 |
_version_ | 1782381576730968064 |
---|---|
author | Singh, Dilpreet Reddy, Chandan K |
author_facet | Singh, Dilpreet Reddy, Chandan K |
author_sort | Singh, Dilpreet |
collection | PubMed |
description | The primary purpose of this paper is to provide an in-depth analysis of different platforms available for performing big data analytics. This paper surveys different hardware platforms available for big data analytics and assesses the advantages and drawbacks of each of these platforms based on various metrics such as scalability, data I/O rate, fault tolerance, real-time processing, data size supported and iterative task support. In addition to the hardware, a detailed description of the software frameworks used within each of these platforms is also discussed along with their strengths and drawbacks. Some of the critical characteristics described here can potentially aid the readers in making an informed decision about the right choice of platforms depending on their computational needs. Using a star ratings table, a rigorous qualitative comparison between different platforms is also discussed for each of the six characteristics that are critical for the algorithms of big data analytics. In order to provide more insights into the effectiveness of each of the platform in the context of big data analytics, specific implementation level details of the widely used k-means clustering algorithm on various platforms are also described in the form pseudocode. |
format | Online Article Text |
id | pubmed-4505391 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-45053912015-07-17 A survey on platforms for big data analytics Singh, Dilpreet Reddy, Chandan K J Big Data Survey Paper The primary purpose of this paper is to provide an in-depth analysis of different platforms available for performing big data analytics. This paper surveys different hardware platforms available for big data analytics and assesses the advantages and drawbacks of each of these platforms based on various metrics such as scalability, data I/O rate, fault tolerance, real-time processing, data size supported and iterative task support. In addition to the hardware, a detailed description of the software frameworks used within each of these platforms is also discussed along with their strengths and drawbacks. Some of the critical characteristics described here can potentially aid the readers in making an informed decision about the right choice of platforms depending on their computational needs. Using a star ratings table, a rigorous qualitative comparison between different platforms is also discussed for each of the six characteristics that are critical for the algorithms of big data analytics. In order to provide more insights into the effectiveness of each of the platform in the context of big data analytics, specific implementation level details of the widely used k-means clustering algorithm on various platforms are also described in the form pseudocode. Springer International Publishing 2014-10-09 2015 /pmc/articles/PMC4505391/ /pubmed/26191487 http://dx.doi.org/10.1186/s40537-014-0008-6 Text en © Singh and Reddy; licensee Springer 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. |
spellingShingle | Survey Paper Singh, Dilpreet Reddy, Chandan K A survey on platforms for big data analytics |
title | A survey on platforms for big data analytics |
title_full | A survey on platforms for big data analytics |
title_fullStr | A survey on platforms for big data analytics |
title_full_unstemmed | A survey on platforms for big data analytics |
title_short | A survey on platforms for big data analytics |
title_sort | survey on platforms for big data analytics |
topic | Survey Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4505391/ https://www.ncbi.nlm.nih.gov/pubmed/26191487 http://dx.doi.org/10.1186/s40537-014-0008-6 |
work_keys_str_mv | AT singhdilpreet asurveyonplatformsforbigdataanalytics AT reddychandank asurveyonplatformsforbigdataanalytics AT singhdilpreet surveyonplatformsforbigdataanalytics AT reddychandank surveyonplatformsforbigdataanalytics |