Cargando…

Performance analysis and optimization of AMGA for the WISDOM environment

AMGA is a gLite-metadata catalogue service designed to offer access to metadata for files stored on the Grid. We evaluated AMGA to analyze whether it is suitable for the WISDOM environment, where thousands of jobs access it simultaneously to get metadata describing docking results and the status of...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahn, Sunil, Kim, Namgyu, Lee, Seehoon, Hwang, Soonwook, Breton, Vincent, Koblitz, Birger
Lenguaje:eng
Publicado: 2008
Materias:
Acceso en línea:http://cds.cern.ch/record/1123338
_version_ 1780914635912773632
author Ahn, Sunil
Kim, Namgyu
Lee, Seehoon
Hwang, Soonwook
Breton, Vincent
Koblitz, Birger
author_facet Ahn, Sunil
Kim, Namgyu
Lee, Seehoon
Hwang, Soonwook
Breton, Vincent
Koblitz, Birger
author_sort Ahn, Sunil
collection CERN
description AMGA is a gLite-metadata catalogue service designed to offer access to metadata for files stored on the Grid. We evaluated AMGA to analyze whether it is suitable for the WISDOM environment, where thousands of jobs access it simultaneously to get metadata describing docking results and the status of jobs. In this work, we address performance issues on AMGA and propose new techniques to improve AMGA performance in the WISDOM environment. In the WISDOM environment, thousands of job agents distributed on the Grid may have access to an AMGA server simultaneously (1) to take docking tasks out of the AMGA server to execute on the machine that they are sitting, (2) to get the related ligand and target information, and (3) to store the docking results. The docking tasks take about 10 to 30 minutes to finish depending on the machine that they run and the docking configuration. We have carried out some performance analysis on the current AMGA implementation. Due to the overhead required to handle GSI/SSL connection on the Grid, it showed about 350% poorer throughput compared with a direct DB access. In the current version of WISDOM, AMGA is used as a placeholder for a task distribution table where docking tasks are stored and maintained. We have found a serious performance degrade due to the overhead caused by the need to lock the whole table to prevent different agents from taking the same task. First, in order to address the SSL/GSI-related performance issue, we have proposed a load-balanced multiple server and a DB connection pool technique in AMGA, Our preliminary test results demonstrate a linear performance improvement in proportion to the number of AMGA servers. Secondly, to deal with the performance degrading problem associated with the locking of the whole table, we modified the AMGA source code and added a new API that allows the two separate AMGA APIs, SELECT and UPDATE needed to take a task, to be invoked at once. Our preliminary tests show that the new API allows about 50 tasks to be retrieved per second in contrast with one task per second being retrieved using the two separate SELECT and UPDATE API calls. We addressed performance issues on the use of AMGA in the WISDOM environment and presented some new techniques to drastically improve the performance of AMGA. The techniques are expected to be integrated in the new release of WISDOM environment, being deployed in the EGEE biomed VO infrastructure for the next WISDOM data challenge.
id cern-1123338
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2008
record_format invenio
spelling cern-11233382019-09-30T06:29:59Zhttp://cds.cern.ch/record/1123338engAhn, SunilKim, NamgyuLee, SeehoonHwang, SoonwookBreton, VincentKoblitz, BirgerPerformance analysis and optimization of AMGA for the WISDOM environmentComputing and ComputersAMGA is a gLite-metadata catalogue service designed to offer access to metadata for files stored on the Grid. We evaluated AMGA to analyze whether it is suitable for the WISDOM environment, where thousands of jobs access it simultaneously to get metadata describing docking results and the status of jobs. In this work, we address performance issues on AMGA and propose new techniques to improve AMGA performance in the WISDOM environment. In the WISDOM environment, thousands of job agents distributed on the Grid may have access to an AMGA server simultaneously (1) to take docking tasks out of the AMGA server to execute on the machine that they are sitting, (2) to get the related ligand and target information, and (3) to store the docking results. The docking tasks take about 10 to 30 minutes to finish depending on the machine that they run and the docking configuration. We have carried out some performance analysis on the current AMGA implementation. Due to the overhead required to handle GSI/SSL connection on the Grid, it showed about 350% poorer throughput compared with a direct DB access. In the current version of WISDOM, AMGA is used as a placeholder for a task distribution table where docking tasks are stored and maintained. We have found a serious performance degrade due to the overhead caused by the need to lock the whole table to prevent different agents from taking the same task. First, in order to address the SSL/GSI-related performance issue, we have proposed a load-balanced multiple server and a DB connection pool technique in AMGA, Our preliminary test results demonstrate a linear performance improvement in proportion to the number of AMGA servers. Secondly, to deal with the performance degrading problem associated with the locking of the whole table, we modified the AMGA source code and added a new API that allows the two separate AMGA APIs, SELECT and UPDATE needed to take a task, to be invoked at once. Our preliminary tests show that the new API allows about 50 tasks to be retrieved per second in contrast with one task per second being retrieved using the two separate SELECT and UPDATE API calls. We addressed performance issues on the use of AMGA in the WISDOM environment and presented some new techniques to drastically improve the performance of AMGA. The techniques are expected to be integrated in the new release of WISDOM environment, being deployed in the EGEE biomed VO infrastructure for the next WISDOM data challenge.oai:cds.cern.ch:11233382008
spellingShingle Computing and Computers
Ahn, Sunil
Kim, Namgyu
Lee, Seehoon
Hwang, Soonwook
Breton, Vincent
Koblitz, Birger
Performance analysis and optimization of AMGA for the WISDOM environment
title Performance analysis and optimization of AMGA for the WISDOM environment
title_full Performance analysis and optimization of AMGA for the WISDOM environment
title_fullStr Performance analysis and optimization of AMGA for the WISDOM environment
title_full_unstemmed Performance analysis and optimization of AMGA for the WISDOM environment
title_short Performance analysis and optimization of AMGA for the WISDOM environment
title_sort performance analysis and optimization of amga for the wisdom environment
topic Computing and Computers
url http://cds.cern.ch/record/1123338
work_keys_str_mv AT ahnsunil performanceanalysisandoptimizationofamgaforthewisdomenvironment
AT kimnamgyu performanceanalysisandoptimizationofamgaforthewisdomenvironment
AT leeseehoon performanceanalysisandoptimizationofamgaforthewisdomenvironment
AT hwangsoonwook performanceanalysisandoptimizationofamgaforthewisdomenvironment
AT bretonvincent performanceanalysisandoptimizationofamgaforthewisdomenvironment
AT koblitzbirger performanceanalysisandoptimizationofamgaforthewisdomenvironment