Cargando…

An evaluation of relational and NoSQL distributed databases on a low-power cluster

The constant growth of social media, unconventional web technologies, mobile applications, and Internet of Things (IoT) devices create challenges for cloud data systems in order to support huge datasets and very high request rates. NoSQL databases, such as Cassandra and HBase, and relational SQL dat...

Descripción completa

Detalles Bibliográficos
Autores principales: da Silva, Lucas Ferreira, Lima, João V. F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10035467/
https://www.ncbi.nlm.nih.gov/pubmed/37359332
http://dx.doi.org/10.1007/s11227-023-05166-7
_version_ 1784911419168260096
author da Silva, Lucas Ferreira
Lima, João V. F.
author_facet da Silva, Lucas Ferreira
Lima, João V. F.
author_sort da Silva, Lucas Ferreira
collection PubMed
description The constant growth of social media, unconventional web technologies, mobile applications, and Internet of Things (IoT) devices create challenges for cloud data systems in order to support huge datasets and very high request rates. NoSQL databases, such as Cassandra and HBase, and relational SQL databases with replication, such as Citus/PostgreSQL, have been used to increase horizontal scalability and high availability of data store systems. In this paper, we evaluated three distributed databases on a low-power low-cost cluster of commodity Single-Board Computers (SBC): relational Citus/PostgreSQL and NoSQL databases Cassandra and HBase. The cluster has 15 Raspberry Pi 3 nodes with Docker Swarm orchestration tool for service deployment and ingress load balancing over SBCs. We believe that a low-cost SBC cluster can support cloud serving goals such as scale-out, elasticity, and high availability. Experimental results clearly demonstrated that there is a trade-off between performance and replication, which provides availability and partition tolerance. Besides, both properties are essential in the context of distributed systems with low-power boards. Cassandra attained better results with its consistency levels specified by the client. Both Citus and HBase enable consistency but it penalizes performance as the number of replicas increases.
format Online
Article
Text
id pubmed-10035467
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-100354672023-03-23 An evaluation of relational and NoSQL distributed databases on a low-power cluster da Silva, Lucas Ferreira Lima, João V. F. J Supercomput Article The constant growth of social media, unconventional web technologies, mobile applications, and Internet of Things (IoT) devices create challenges for cloud data systems in order to support huge datasets and very high request rates. NoSQL databases, such as Cassandra and HBase, and relational SQL databases with replication, such as Citus/PostgreSQL, have been used to increase horizontal scalability and high availability of data store systems. In this paper, we evaluated three distributed databases on a low-power low-cost cluster of commodity Single-Board Computers (SBC): relational Citus/PostgreSQL and NoSQL databases Cassandra and HBase. The cluster has 15 Raspberry Pi 3 nodes with Docker Swarm orchestration tool for service deployment and ingress load balancing over SBCs. We believe that a low-cost SBC cluster can support cloud serving goals such as scale-out, elasticity, and high availability. Experimental results clearly demonstrated that there is a trade-off between performance and replication, which provides availability and partition tolerance. Besides, both properties are essential in the context of distributed systems with low-power boards. Cassandra attained better results with its consistency levels specified by the client. Both Citus and HBase enable consistency but it penalizes performance as the number of replicas increases. Springer US 2023-03-23 /pmc/articles/PMC10035467/ /pubmed/37359332 http://dx.doi.org/10.1007/s11227-023-05166-7 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
da Silva, Lucas Ferreira
Lima, João V. F.
An evaluation of relational and NoSQL distributed databases on a low-power cluster
title An evaluation of relational and NoSQL distributed databases on a low-power cluster
title_full An evaluation of relational and NoSQL distributed databases on a low-power cluster
title_fullStr An evaluation of relational and NoSQL distributed databases on a low-power cluster
title_full_unstemmed An evaluation of relational and NoSQL distributed databases on a low-power cluster
title_short An evaluation of relational and NoSQL distributed databases on a low-power cluster
title_sort evaluation of relational and nosql distributed databases on a low-power cluster
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10035467/
https://www.ncbi.nlm.nih.gov/pubmed/37359332
http://dx.doi.org/10.1007/s11227-023-05166-7
work_keys_str_mv AT dasilvalucasferreira anevaluationofrelationalandnosqldistributeddatabasesonalowpowercluster
AT limajoaovf anevaluationofrelationalandnosqldistributeddatabasesonalowpowercluster
AT dasilvalucasferreira evaluationofrelationalandnosqldistributeddatabasesonalowpowercluster
AT limajoaovf evaluationofrelationalandnosqldistributeddatabasesonalowpowercluster