Cargando…

Introducing Object Storage in Hadoop Ecosystem

CERN deals with one of the highest rates of data every day. The current Hadoop Ecosystem uses Hadoop Data File System (HDFS) as a storage option. While HDFS does not generate any problems at the moment, it has some inefficiencies. First, storage capacity overhead cost. Second, HDFS is no longer a to...

Descripción completa

Detalles Bibliográficos
Autor principal: Pareja Prieto, Laura
Lenguaje:eng
Publicado: 2022
Materias:
Acceso en línea:http://cds.cern.ch/record/2835585
_version_ 1780975647876710400
author Pareja Prieto, Laura
author_facet Pareja Prieto, Laura
author_sort Pareja Prieto, Laura
collection CERN
description CERN deals with one of the highest rates of data every day. The current Hadoop Ecosystem uses Hadoop Data File System (HDFS) as a storage option. While HDFS does not generate any problems at the moment, it has some inefficiencies. First, storage capacity overhead cost. Second, HDFS is no longer a top Big Data standard in the industry. The community is moving towards solutions based on object storage. Object storage potentially offers a better cost-effective solution. This project aims to evaluate Apache Ozone as a possible object storage solution for the current Hadoop architecture. It would look into the implementation of Ozone and how it connects and operates with some current services of the Hadoop architecture like Yarn and Apache Spark.
id cern-2835585
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2022
record_format invenio
spelling cern-28355852022-10-05T21:41:39Zhttp://cds.cern.ch/record/2835585engPareja Prieto, LauraIntroducing Object Storage in Hadoop EcosystemPhysics in GeneralCERN deals with one of the highest rates of data every day. The current Hadoop Ecosystem uses Hadoop Data File System (HDFS) as a storage option. While HDFS does not generate any problems at the moment, it has some inefficiencies. First, storage capacity overhead cost. Second, HDFS is no longer a top Big Data standard in the industry. The community is moving towards solutions based on object storage. Object storage potentially offers a better cost-effective solution. This project aims to evaluate Apache Ozone as a possible object storage solution for the current Hadoop architecture. It would look into the implementation of Ozone and how it connects and operates with some current services of the Hadoop architecture like Yarn and Apache Spark.CERN-STUDENTS-Note-2022-191oai:cds.cern.ch:28355852022-10-05
spellingShingle Physics in General
Pareja Prieto, Laura
Introducing Object Storage in Hadoop Ecosystem
title Introducing Object Storage in Hadoop Ecosystem
title_full Introducing Object Storage in Hadoop Ecosystem
title_fullStr Introducing Object Storage in Hadoop Ecosystem
title_full_unstemmed Introducing Object Storage in Hadoop Ecosystem
title_short Introducing Object Storage in Hadoop Ecosystem
title_sort introducing object storage in hadoop ecosystem
topic Physics in General
url http://cds.cern.ch/record/2835585
work_keys_str_mv AT parejaprietolaura introducingobjectstorageinhadoopecosystem