Cargando…

Evaluation of Erasure Coding & other features of Hadoop 3

<!--HTML-->Apache Hadoop is a set of 2 domains: data computation such as Spark, MapReduce, Flink, etc and data storage - HDFS. HDFS is a distributed file system. Current HDFS provides 3x replication for data redundancy and availability. But it has 200% storage overhead. However there is a big...

Descripción completa

Detalles Bibliográficos
Autor principal: Seidan, Nazerke
Lenguaje:eng
Publicado: 2019
Materias:
Acceso en línea:http://cds.cern.ch/record/2687101
_version_ 1780963578802601984
author Seidan, Nazerke
author_facet Seidan, Nazerke
author_sort Seidan, Nazerke
collection CERN
description <!--HTML-->Apache Hadoop is a set of 2 domains: data computation such as Spark, MapReduce, Flink, etc and data storage - HDFS. HDFS is a distributed file system. Current HDFS provides 3x replication for data redundancy and availability. But it has 200% storage overhead. However there is a big improvement in Hadoop 3 for replication which is Erasure Coding (EC). Erasure Coding gives the same level of fault tolerance as 3x replication but with much less storage space. My project aims to evaluate the performance of Erasure Coding.
id cern-2687101
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2019
record_format invenio
spelling cern-26871012022-11-03T21:19:41Zhttp://cds.cern.ch/record/2687101engSeidan, NazerkeEvaluation of Erasure Coding & other features of Hadoop 3Second CERN openlab summer student lightning talk sessionCERN openlab Summer Student programme 2019<!--HTML-->Apache Hadoop is a set of 2 domains: data computation such as Spark, MapReduce, Flink, etc and data storage - HDFS. HDFS is a distributed file system. Current HDFS provides 3x replication for data redundancy and availability. But it has 200% storage overhead. However there is a big improvement in Hadoop 3 for replication which is Erasure Coding (EC). Erasure Coding gives the same level of fault tolerance as 3x replication but with much less storage space. My project aims to evaluate the performance of Erasure Coding.oai:cds.cern.ch:26871012019
spellingShingle CERN openlab Summer Student programme 2019
Seidan, Nazerke
Evaluation of Erasure Coding & other features of Hadoop 3
title Evaluation of Erasure Coding & other features of Hadoop 3
title_full Evaluation of Erasure Coding & other features of Hadoop 3
title_fullStr Evaluation of Erasure Coding & other features of Hadoop 3
title_full_unstemmed Evaluation of Erasure Coding & other features of Hadoop 3
title_short Evaluation of Erasure Coding & other features of Hadoop 3
title_sort evaluation of erasure coding & other features of hadoop 3
topic CERN openlab Summer Student programme 2019
url http://cds.cern.ch/record/2687101
work_keys_str_mv AT seidannazerke evaluationoferasurecodingotherfeaturesofhadoop3
AT seidannazerke secondcernopenlabsummerstudentlightningtalksession