Cargando…

Accelerating hyperparameter optimization using performance prediction on a heterogeneous HPC system

<!--HTML--><p>Training and hyperparameter optimization (HPO) of deep learning-based (DL) AI models is often compute resource intensive and calls for the use of large-scale distributed resources as well as scalable and resource efficient hyperparameter search and evaluation algorithms. In...

Descripción completa

Detalles Bibliográficos
Autor principal: Garcia Amboage, Juan Pablo
Lenguaje:eng
Publicado: 2023
Materias:
Acceso en línea:http://cds.cern.ch/record/2865381
_version_ 1780978039981604864
author Garcia Amboage, Juan Pablo
author_facet Garcia Amboage, Juan Pablo
author_sort Garcia Amboage, Juan Pablo
collection CERN
description <!--HTML--><p>Training and hyperparameter optimization (HPO) of deep learning-based (DL) AI models is often compute resource intensive and calls for the use of large-scale distributed resources as well as scalable and resource efficient hyperparameter search and evaluation algorithms. In this context, performance prediction emerges as a potential approach to accelerate the HPO process.</p><p>Using meta-models, referred to as performance predictors, it is possible to estimate the performance of a given configuration at a particular epoch by leveraging information from its partial learning curve. By employing performance prediction, it is possible to prioritize the training of the most promising configurations based on their predicted performance, while avoiding the need to fully train configurations with worse predicted performance. Consequently, this approach holds great potential for reducing the time and computational resources required during the HPO process.</p><p>In the context of the CoE RAISE project, this work explores novel techniques based on performance prediction to accelerate the HPO process of the particle flow reconstruction DL model known as MLPF and other NN architectures leveraging the use of High Performance Computing (HPC) resources for training the target model and the quantum annealer at JSC for training the performance predictors. Moreover, a new HPO algorithm called Swift-Hyperband, which integrates the existing Hyperband algorithm with the use of performance predictors, is proposed.</p><p><strong>About the speaker</strong>&nbsp;<br><span style="color:hsl(210,75%,60%);"><strong>Juan Pablo García</strong></span> is enrolled in the final year of a 6-year double degree program in Computer Science and Mathematics at the University of Santiago de Compostela (USC), Spain. He is currently working on his Computer Science thesis at CERN in the context of the EU funded CoE RAISE project. Juan Pablo is particularly interested in the fields where Mathematics and Computer Science intersect. Therefore, within Computer Science, he is interested in machine intelligence, hyperparameter optimization, and the theory of computer science, among many other things. In Mathematics, he is especially curious about topology, operations research, and optimization.</p>
id cern-2865381
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2023
record_format invenio
spelling cern-28653812023-07-18T19:59:02Zhttp://cds.cern.ch/record/2865381engGarcia Amboage, Juan PabloAccelerating hyperparameter optimization using performance prediction on a heterogeneous HPC systemAccelerating hyperparameter optimization using performance prediction on a heterogeneous HPC systemCERN Computing Seminar | openlab series<!--HTML--><p>Training and hyperparameter optimization (HPO) of deep learning-based (DL) AI models is often compute resource intensive and calls for the use of large-scale distributed resources as well as scalable and resource efficient hyperparameter search and evaluation algorithms. In this context, performance prediction emerges as a potential approach to accelerate the HPO process.</p><p>Using meta-models, referred to as performance predictors, it is possible to estimate the performance of a given configuration at a particular epoch by leveraging information from its partial learning curve. By employing performance prediction, it is possible to prioritize the training of the most promising configurations based on their predicted performance, while avoiding the need to fully train configurations with worse predicted performance. Consequently, this approach holds great potential for reducing the time and computational resources required during the HPO process.</p><p>In the context of the CoE RAISE project, this work explores novel techniques based on performance prediction to accelerate the HPO process of the particle flow reconstruction DL model known as MLPF and other NN architectures leveraging the use of High Performance Computing (HPC) resources for training the target model and the quantum annealer at JSC for training the performance predictors. Moreover, a new HPO algorithm called Swift-Hyperband, which integrates the existing Hyperband algorithm with the use of performance predictors, is proposed.</p><p><strong>About the speaker</strong>&nbsp;<br><span style="color:hsl(210,75%,60%);"><strong>Juan Pablo García</strong></span> is enrolled in the final year of a 6-year double degree program in Computer Science and Mathematics at the University of Santiago de Compostela (USC), Spain. He is currently working on his Computer Science thesis at CERN in the context of the EU funded CoE RAISE project. Juan Pablo is particularly interested in the fields where Mathematics and Computer Science intersect. Therefore, within Computer Science, he is interested in machine intelligence, hyperparameter optimization, and the theory of computer science, among many other things. In Mathematics, he is especially curious about topology, operations research, and optimization.</p>oai:cds.cern.ch:28653812023
spellingShingle CERN Computing Seminar | openlab series
Garcia Amboage, Juan Pablo
Accelerating hyperparameter optimization using performance prediction on a heterogeneous HPC system
title Accelerating hyperparameter optimization using performance prediction on a heterogeneous HPC system
title_full Accelerating hyperparameter optimization using performance prediction on a heterogeneous HPC system
title_fullStr Accelerating hyperparameter optimization using performance prediction on a heterogeneous HPC system
title_full_unstemmed Accelerating hyperparameter optimization using performance prediction on a heterogeneous HPC system
title_short Accelerating hyperparameter optimization using performance prediction on a heterogeneous HPC system
title_sort accelerating hyperparameter optimization using performance prediction on a heterogeneous hpc system
topic CERN Computing Seminar | openlab series
url http://cds.cern.ch/record/2865381
work_keys_str_mv AT garciaamboagejuanpablo acceleratinghyperparameteroptimizationusingperformancepredictiononaheterogeneoushpcsystem