Cargando…

Developing and optimizing applications in Hadoop

This contribution is about sharing our recent experiences of building Hadoop based application. Hadoop ecosystem now offers myriad of tools which can overwhelm new users, yet there are successful ways these tools can be leveraged to solve problems. We look at factors to consider when using Hadoop to...

Descripción completa

Detalles Bibliográficos
Autores principales: Kothuri, P, Garcia, D, Hermans, J
Lenguaje:eng
Publicado: 2017
Materias:
Acceso en línea:https://dx.doi.org/10.1088/1742-6596/898/7/072038
http://cds.cern.ch/record/2296800
_version_ 1780956906439835648
author Kothuri, P
Garcia, D
Hermans, J
author_facet Kothuri, P
Garcia, D
Hermans, J
author_sort Kothuri, P
collection CERN
description This contribution is about sharing our recent experiences of building Hadoop based application. Hadoop ecosystem now offers myriad of tools which can overwhelm new users, yet there are successful ways these tools can be leveraged to solve problems. We look at factors to consider when using Hadoop to model and store data, best practices for moving data in and out of the system and common processing patterns, at each stage relating with the real world experience gained while developing such application. We share many of the design choices, tools developed and how to profile a distributed application which can be applied for other scenarios as well. In conclusion, the goal of the presentation is to provide guidance to architect Hadoop based application and share some of the reusable components developed in this process.
id oai-inspirehep.net-1638564
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2017
record_format invenio
spelling oai-inspirehep.net-16385642021-02-09T10:06:26Zdoi:10.1088/1742-6596/898/7/072038http://cds.cern.ch/record/2296800engKothuri, PGarcia, DHermans, JDeveloping and optimizing applications in HadoopComputing and ComputersThis contribution is about sharing our recent experiences of building Hadoop based application. Hadoop ecosystem now offers myriad of tools which can overwhelm new users, yet there are successful ways these tools can be leveraged to solve problems. We look at factors to consider when using Hadoop to model and store data, best practices for moving data in and out of the system and common processing patterns, at each stage relating with the real world experience gained while developing such application. We share many of the design choices, tools developed and how to profile a distributed application which can be applied for other scenarios as well. In conclusion, the goal of the presentation is to provide guidance to architect Hadoop based application and share some of the reusable components developed in this process.oai:inspirehep.net:16385642017
spellingShingle Computing and Computers
Kothuri, P
Garcia, D
Hermans, J
Developing and optimizing applications in Hadoop
title Developing and optimizing applications in Hadoop
title_full Developing and optimizing applications in Hadoop
title_fullStr Developing and optimizing applications in Hadoop
title_full_unstemmed Developing and optimizing applications in Hadoop
title_short Developing and optimizing applications in Hadoop
title_sort developing and optimizing applications in hadoop
topic Computing and Computers
url https://dx.doi.org/10.1088/1742-6596/898/7/072038
http://cds.cern.ch/record/2296800
work_keys_str_mv AT kothurip developingandoptimizingapplicationsinhadoop
AT garciad developingandoptimizingapplicationsinhadoop
AT hermansj developingandoptimizingapplicationsinhadoop