Cargando…
Introducing Object Storage in Hadoop Ecosystem
CERN deals with one of the highest rates of data every day. The current Hadoop Ecosystem uses Hadoop Data File System (HDFS) as a storage option. While HDFS does not generate any problems at the moment, it has some inefficiencies. First, storage capacity overhead cost. Second, HDFS is no longer a to...
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
2022
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2835585 |
_version_ | 1780975647876710400 |
---|---|
author | Pareja Prieto, Laura |
author_facet | Pareja Prieto, Laura |
author_sort | Pareja Prieto, Laura |
collection | CERN |
description | CERN deals with one of the highest rates of data every day. The current Hadoop Ecosystem uses Hadoop Data File System (HDFS) as a storage option. While HDFS does not generate any problems at the moment, it has some inefficiencies. First, storage capacity overhead cost. Second, HDFS is no longer a top Big Data standard in the industry. The community is moving towards solutions based on object storage. Object storage potentially offers a better cost-effective solution. This project aims to evaluate Apache Ozone as a possible object storage solution for the current Hadoop architecture. It would look into the implementation of Ozone and how it connects and operates with some current services of the Hadoop architecture like Yarn and Apache Spark. |
id | cern-2835585 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2022 |
record_format | invenio |
spelling | cern-28355852022-10-05T21:41:39Zhttp://cds.cern.ch/record/2835585engPareja Prieto, LauraIntroducing Object Storage in Hadoop EcosystemPhysics in GeneralCERN deals with one of the highest rates of data every day. The current Hadoop Ecosystem uses Hadoop Data File System (HDFS) as a storage option. While HDFS does not generate any problems at the moment, it has some inefficiencies. First, storage capacity overhead cost. Second, HDFS is no longer a top Big Data standard in the industry. The community is moving towards solutions based on object storage. Object storage potentially offers a better cost-effective solution. This project aims to evaluate Apache Ozone as a possible object storage solution for the current Hadoop architecture. It would look into the implementation of Ozone and how it connects and operates with some current services of the Hadoop architecture like Yarn and Apache Spark.CERN-STUDENTS-Note-2022-191oai:cds.cern.ch:28355852022-10-05 |
spellingShingle | Physics in General Pareja Prieto, Laura Introducing Object Storage in Hadoop Ecosystem |
title | Introducing Object Storage in Hadoop Ecosystem |
title_full | Introducing Object Storage in Hadoop Ecosystem |
title_fullStr | Introducing Object Storage in Hadoop Ecosystem |
title_full_unstemmed | Introducing Object Storage in Hadoop Ecosystem |
title_short | Introducing Object Storage in Hadoop Ecosystem |
title_sort | introducing object storage in hadoop ecosystem |
topic | Physics in General |
url | http://cds.cern.ch/record/2835585 |
work_keys_str_mv | AT parejaprietolaura introducingobjectstorageinhadoopecosystem |