Cargando…

Hyperparameter Tuning with High Performance Computing Machine Learning for Imbalanced Alzheimer’s Disease Data

Accurate detection is still a challenge in machine learning (ML) for Alzheimer’s disease (AD). Class imbalance in imbalanced AD data is another big challenge for machine-learning algorithms working under the assumption that the data are evenly distributed within classes. Here, we present a hyperpara...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Fan, Petersen, Melissa, Johnson, Leigh, Hall, James, O’Bryant, Sid E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9662287/
https://www.ncbi.nlm.nih.gov/pubmed/36381541
http://dx.doi.org/10.3390/app12136670
_version_ 1784830654975836160
author Zhang, Fan
Petersen, Melissa
Johnson, Leigh
Hall, James
O’Bryant, Sid E.
author_facet Zhang, Fan
Petersen, Melissa
Johnson, Leigh
Hall, James
O’Bryant, Sid E.
author_sort Zhang, Fan
collection PubMed
description Accurate detection is still a challenge in machine learning (ML) for Alzheimer’s disease (AD). Class imbalance in imbalanced AD data is another big challenge for machine-learning algorithms working under the assumption that the data are evenly distributed within classes. Here, we present a hyperparameter tuning workflow with high-performance computing (HPC) for imbalanced data related to prevalent mild cognitive impairment (MCI) and AD in the Health and Aging Brain Study-Health Disparities (HABS-HD) project. We applied a single-node multicore parallel mode to hyperparameter tuning of gamma, cost, and class weight using a support vector machine (SVM) model with 10 times repeated fivefold cross-validation. We executed the hyperparameter tuning workflow with R’s bigmemory, foreach, and doParallel packages on Texas Advanced Computing Center (TACC)’s Lonestar6 system. The computational time was dramatically reduced by up to 98.2% for the high-performance SVM hyperparameter tuning model, and the performance of cross-validation was also improved (the positive predictive value and the negative predictive value at base rate 12% were, respectively, 16.42% and 92.72%). Our results show that a single-node multicore parallel structure and high-performance SVM hyperparameter tuning model can deliver efficient and fast computation and achieve outstanding agility, simplicity, and productivity for imbalanced data in AD applications.
format Online
Article
Text
id pubmed-9662287
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-96622872022-11-14 Hyperparameter Tuning with High Performance Computing Machine Learning for Imbalanced Alzheimer’s Disease Data Zhang, Fan Petersen, Melissa Johnson, Leigh Hall, James O’Bryant, Sid E. Appl Sci (Basel) Article Accurate detection is still a challenge in machine learning (ML) for Alzheimer’s disease (AD). Class imbalance in imbalanced AD data is another big challenge for machine-learning algorithms working under the assumption that the data are evenly distributed within classes. Here, we present a hyperparameter tuning workflow with high-performance computing (HPC) for imbalanced data related to prevalent mild cognitive impairment (MCI) and AD in the Health and Aging Brain Study-Health Disparities (HABS-HD) project. We applied a single-node multicore parallel mode to hyperparameter tuning of gamma, cost, and class weight using a support vector machine (SVM) model with 10 times repeated fivefold cross-validation. We executed the hyperparameter tuning workflow with R’s bigmemory, foreach, and doParallel packages on Texas Advanced Computing Center (TACC)’s Lonestar6 system. The computational time was dramatically reduced by up to 98.2% for the high-performance SVM hyperparameter tuning model, and the performance of cross-validation was also improved (the positive predictive value and the negative predictive value at base rate 12% were, respectively, 16.42% and 92.72%). Our results show that a single-node multicore parallel structure and high-performance SVM hyperparameter tuning model can deliver efficient and fast computation and achieve outstanding agility, simplicity, and productivity for imbalanced data in AD applications. 2022-07-01 /pmc/articles/PMC9662287/ /pubmed/36381541 http://dx.doi.org/10.3390/app12136670 Text en https://creativecommons.org/licenses/by/4.0/This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Fan
Petersen, Melissa
Johnson, Leigh
Hall, James
O’Bryant, Sid E.
Hyperparameter Tuning with High Performance Computing Machine Learning for Imbalanced Alzheimer’s Disease Data
title Hyperparameter Tuning with High Performance Computing Machine Learning for Imbalanced Alzheimer’s Disease Data
title_full Hyperparameter Tuning with High Performance Computing Machine Learning for Imbalanced Alzheimer’s Disease Data
title_fullStr Hyperparameter Tuning with High Performance Computing Machine Learning for Imbalanced Alzheimer’s Disease Data
title_full_unstemmed Hyperparameter Tuning with High Performance Computing Machine Learning for Imbalanced Alzheimer’s Disease Data
title_short Hyperparameter Tuning with High Performance Computing Machine Learning for Imbalanced Alzheimer’s Disease Data
title_sort hyperparameter tuning with high performance computing machine learning for imbalanced alzheimer’s disease data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9662287/
https://www.ncbi.nlm.nih.gov/pubmed/36381541
http://dx.doi.org/10.3390/app12136670
work_keys_str_mv AT zhangfan hyperparametertuningwithhighperformancecomputingmachinelearningforimbalancedalzheimersdiseasedata
AT petersenmelissa hyperparametertuningwithhighperformancecomputingmachinelearningforimbalancedalzheimersdiseasedata
AT johnsonleigh hyperparametertuningwithhighperformancecomputingmachinelearningforimbalancedalzheimersdiseasedata
AT halljames hyperparametertuningwithhighperformancecomputingmachinelearningforimbalancedalzheimersdiseasedata
AT obryantside hyperparametertuningwithhighperformancecomputingmachinelearningforimbalancedalzheimersdiseasedata