Cargando…

Evaluation of Erasure Coding & other features of Hadoop 3

<!--HTML-->Apache Hadoop is a set of 2 domains: data computation such as Spark, MapReduce, Flink, etc and data storage - HDFS. HDFS is a distributed file system. Current HDFS provides 3x replication for data redundancy and availability. But it has 200% storage overhead. However there is a big...

Descripción completa

Detalles Bibliográficos
Autor principal: Seidan, Nazerke
Lenguaje:eng
Publicado: 2019
Materias:
Acceso en línea:http://cds.cern.ch/record/2687101