Cargando…

Multi-objective hyperparameter optimization on gradient-boosting for breast cancer detection

The most commonly occurring cancer among women, breast cancer, causes lakhs of deaths annually, which can be prevented by early detection and treatment. Detection can be done by using machine learning models on histopathological images which are affordable, reliable, and accurate. Previous studies i...

Descripción completa

Detalles Bibliográficos
Autores principales: Singh, Priya, Gupta, Swayam, Gupta, Vasu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer India 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10255948/
http://dx.doi.org/10.1007/s13198-023-01955-8
_version_ 1785056996035133440
author Singh, Priya
Gupta, Swayam
Gupta, Vasu
author_facet Singh, Priya
Gupta, Swayam
Gupta, Vasu
author_sort Singh, Priya
collection PubMed
description The most commonly occurring cancer among women, breast cancer, causes lakhs of deaths annually, which can be prevented by early detection and treatment. Detection can be done by using machine learning models on histopathological images which are affordable, reliable, and accurate. Previous studies in this regard have focused on transfer learning methods combining feature selection using Convolutional Neural Networks (CNNs) and an ensemble of gradient-boosting algorithms. However, none of the state-of-the-art techniques capture the multi-objective nature of Breast Cancer Detection (BCD) and tend to improve a single performance measure such as Accuracy and F1 score, which fail to capture certain essential aspects of the problem as the cost of misclassification varies greatly depending on its type. In this study, a multi-objective hyperparameter optimization technique for Breast Cancer Prediction is proposed by comparing random search, Non-Dominated Sorting Genetic Algorithm (NSGA-II) and Bayesian optimization. This approach is applied to an ensemble of three popular gradient-boosting techniques: extreme gradient-boosting, light gradient-boosting machine and categorical boosting on features obtained from Inception-ResNet-v2 CNN model applied on the benchmark BreakHis dataset to optimize Precision, Recall, Accuracy, and AUC simultaneously. The novel NSGA2-IRv2-CXL model proposed in this study achieves maximum Accuracy of 94.40%, AUC of 98.16, Precision of 95.77%, and Recall of 99.29% for 100[Formula: see text] magnification. The study also establishes trade-offs between performance metrics thereby opening avenues for further research in multi-objective approaches to BCD which can provide a larger view of the strengths and weaknesses of the classification model.
format Online
Article
Text
id pubmed-10255948
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer India
record_format MEDLINE/PubMed
spelling pubmed-102559482023-06-12 Multi-objective hyperparameter optimization on gradient-boosting for breast cancer detection Singh, Priya Gupta, Swayam Gupta, Vasu Int J Syst Assur Eng Manag Original Article The most commonly occurring cancer among women, breast cancer, causes lakhs of deaths annually, which can be prevented by early detection and treatment. Detection can be done by using machine learning models on histopathological images which are affordable, reliable, and accurate. Previous studies in this regard have focused on transfer learning methods combining feature selection using Convolutional Neural Networks (CNNs) and an ensemble of gradient-boosting algorithms. However, none of the state-of-the-art techniques capture the multi-objective nature of Breast Cancer Detection (BCD) and tend to improve a single performance measure such as Accuracy and F1 score, which fail to capture certain essential aspects of the problem as the cost of misclassification varies greatly depending on its type. In this study, a multi-objective hyperparameter optimization technique for Breast Cancer Prediction is proposed by comparing random search, Non-Dominated Sorting Genetic Algorithm (NSGA-II) and Bayesian optimization. This approach is applied to an ensemble of three popular gradient-boosting techniques: extreme gradient-boosting, light gradient-boosting machine and categorical boosting on features obtained from Inception-ResNet-v2 CNN model applied on the benchmark BreakHis dataset to optimize Precision, Recall, Accuracy, and AUC simultaneously. The novel NSGA2-IRv2-CXL model proposed in this study achieves maximum Accuracy of 94.40%, AUC of 98.16, Precision of 95.77%, and Recall of 99.29% for 100[Formula: see text] magnification. The study also establishes trade-offs between performance metrics thereby opening avenues for further research in multi-objective approaches to BCD which can provide a larger view of the strengths and weaknesses of the classification model. Springer India 2023-06-09 /pmc/articles/PMC10255948/ http://dx.doi.org/10.1007/s13198-023-01955-8 Text en © The Author(s) under exclusive licence to The Society for Reliability Engineering, Quality and Operations Management (SREQOM), India and The Division of Operation and Maintenance, Lulea University of Technology, Sweden 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Article
Singh, Priya
Gupta, Swayam
Gupta, Vasu
Multi-objective hyperparameter optimization on gradient-boosting for breast cancer detection
title Multi-objective hyperparameter optimization on gradient-boosting for breast cancer detection
title_full Multi-objective hyperparameter optimization on gradient-boosting for breast cancer detection
title_fullStr Multi-objective hyperparameter optimization on gradient-boosting for breast cancer detection
title_full_unstemmed Multi-objective hyperparameter optimization on gradient-boosting for breast cancer detection
title_short Multi-objective hyperparameter optimization on gradient-boosting for breast cancer detection
title_sort multi-objective hyperparameter optimization on gradient-boosting for breast cancer detection
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10255948/
http://dx.doi.org/10.1007/s13198-023-01955-8
work_keys_str_mv AT singhpriya multiobjectivehyperparameteroptimizationongradientboostingforbreastcancerdetection
AT guptaswayam multiobjectivehyperparameteroptimizationongradientboostingforbreastcancerdetection
AT guptavasu multiobjectivehyperparameteroptimizationongradientboostingforbreastcancerdetection