Cargando…
Developing and optimizing applications in Hadoop
This contribution is about sharing our recent experiences of building Hadoop based application. Hadoop ecosystem now offers myriad of tools which can overwhelm new users, yet there are successful ways these tools can be leveraged to solve problems. We look at factors to consider when using Hadoop to...
Autores principales: | , , |
---|---|
Lenguaje: | eng |
Publicado: |
2017
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1088/1742-6596/898/7/072038 http://cds.cern.ch/record/2296800 |
_version_ | 1780956906439835648 |
---|---|
author | Kothuri, P Garcia, D Hermans, J |
author_facet | Kothuri, P Garcia, D Hermans, J |
author_sort | Kothuri, P |
collection | CERN |
description | This contribution is about sharing our recent experiences of building Hadoop based application. Hadoop ecosystem now offers myriad of tools which can overwhelm new users, yet there are successful ways these tools can be leveraged to solve problems. We look at factors to consider when using Hadoop to model and store data, best practices for moving data in and out of the system and common processing patterns, at each stage relating with the real world experience gained while developing such application. We share many of the design choices, tools developed and how to profile a distributed application which can be applied for other scenarios as well. In conclusion, the goal of the presentation is to provide guidance to architect Hadoop based application and share some of the reusable components developed in this process. |
id | oai-inspirehep.net-1638564 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2017 |
record_format | invenio |
spelling | oai-inspirehep.net-16385642021-02-09T10:06:26Zdoi:10.1088/1742-6596/898/7/072038http://cds.cern.ch/record/2296800engKothuri, PGarcia, DHermans, JDeveloping and optimizing applications in HadoopComputing and ComputersThis contribution is about sharing our recent experiences of building Hadoop based application. Hadoop ecosystem now offers myriad of tools which can overwhelm new users, yet there are successful ways these tools can be leveraged to solve problems. We look at factors to consider when using Hadoop to model and store data, best practices for moving data in and out of the system and common processing patterns, at each stage relating with the real world experience gained while developing such application. We share many of the design choices, tools developed and how to profile a distributed application which can be applied for other scenarios as well. In conclusion, the goal of the presentation is to provide guidance to architect Hadoop based application and share some of the reusable components developed in this process.oai:inspirehep.net:16385642017 |
spellingShingle | Computing and Computers Kothuri, P Garcia, D Hermans, J Developing and optimizing applications in Hadoop |
title | Developing and optimizing applications in Hadoop |
title_full | Developing and optimizing applications in Hadoop |
title_fullStr | Developing and optimizing applications in Hadoop |
title_full_unstemmed | Developing and optimizing applications in Hadoop |
title_short | Developing and optimizing applications in Hadoop |
title_sort | developing and optimizing applications in hadoop |
topic | Computing and Computers |
url | https://dx.doi.org/10.1088/1742-6596/898/7/072038 http://cds.cern.ch/record/2296800 |
work_keys_str_mv | AT kothurip developingandoptimizingapplicationsinhadoop AT garciad developingandoptimizingapplicationsinhadoop AT hermansj developingandoptimizingapplicationsinhadoop |