Cargando…
Evaluation of Erasure Coding & other features of Hadoop 3
<!--HTML-->Apache Hadoop is a set of 2 domains: data computation such as Spark, MapReduce, Flink, etc and data storage - HDFS. HDFS is a distributed file system. Current HDFS provides 3x replication for data redundancy and availability. But it has 200% storage overhead. However there is a big...
Autor principal: | |
---|---|
Lenguaje: | eng |
Publicado: |
2019
|
Materias: | |
Acceso en línea: | http://cds.cern.ch/record/2687101 |
_version_ | 1780963578802601984 |
---|---|
author | Seidan, Nazerke |
author_facet | Seidan, Nazerke |
author_sort | Seidan, Nazerke |
collection | CERN |
description | <!--HTML-->Apache Hadoop is a set of 2 domains: data computation such as Spark, MapReduce, Flink, etc and data storage - HDFS. HDFS is a distributed file system. Current HDFS provides 3x replication for data redundancy and availability. But it has 200% storage overhead. However there is a big improvement in Hadoop 3 for replication which is Erasure Coding (EC).
Erasure Coding gives the same level of fault tolerance as 3x replication but with much less storage space.
My project aims to evaluate the performance of Erasure Coding. |
id | cern-2687101 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2019 |
record_format | invenio |
spelling | cern-26871012022-11-03T21:19:41Zhttp://cds.cern.ch/record/2687101engSeidan, NazerkeEvaluation of Erasure Coding & other features of Hadoop 3Second CERN openlab summer student lightning talk sessionCERN openlab Summer Student programme 2019<!--HTML-->Apache Hadoop is a set of 2 domains: data computation such as Spark, MapReduce, Flink, etc and data storage - HDFS. HDFS is a distributed file system. Current HDFS provides 3x replication for data redundancy and availability. But it has 200% storage overhead. However there is a big improvement in Hadoop 3 for replication which is Erasure Coding (EC). Erasure Coding gives the same level of fault tolerance as 3x replication but with much less storage space. My project aims to evaluate the performance of Erasure Coding.oai:cds.cern.ch:26871012019 |
spellingShingle | CERN openlab Summer Student programme 2019 Seidan, Nazerke Evaluation of Erasure Coding & other features of Hadoop 3 |
title | Evaluation of Erasure Coding & other features of Hadoop 3 |
title_full | Evaluation of Erasure Coding & other features of Hadoop 3 |
title_fullStr | Evaluation of Erasure Coding & other features of Hadoop 3 |
title_full_unstemmed | Evaluation of Erasure Coding & other features of Hadoop 3 |
title_short | Evaluation of Erasure Coding & other features of Hadoop 3 |
title_sort | evaluation of erasure coding & other features of hadoop 3 |
topic | CERN openlab Summer Student programme 2019 |
url | http://cds.cern.ch/record/2687101 |
work_keys_str_mv | AT seidannazerke evaluationoferasurecodingotherfeaturesofhadoop3 AT seidannazerke secondcernopenlabsummerstudentlightningtalksession |