Cargando…

Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study

BACKGROUND: The use of artificial intelligence (AI) in the medical domain has attracted considerable research interest. Inference applications in the medical domain require energy-efficient AI models. In contrast to other types of data in visual AI, data from medical laboratories usually comprise fe...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Jia-Ruei, Chen, Chun-Hsien, Huang, Tsung-Wei, Lu, Jang-Jih, Chung, Chia-Ru, Lin, Ting-Wei, Wu, Min-Hsien, Tseng, Yi-Ju, Wang, Hsin-Yao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8826151/
https://www.ncbi.nlm.nih.gov/pubmed/35076405
http://dx.doi.org/10.2196/28036
_version_ 1784647372609945600
author Yu, Jia-Ruei
Chen, Chun-Hsien
Huang, Tsung-Wei
Lu, Jang-Jih
Chung, Chia-Ru
Lin, Ting-Wei
Wu, Min-Hsien
Tseng, Yi-Ju
Wang, Hsin-Yao
author_facet Yu, Jia-Ruei
Chen, Chun-Hsien
Huang, Tsung-Wei
Lu, Jang-Jih
Chung, Chia-Ru
Lin, Ting-Wei
Wu, Min-Hsien
Tseng, Yi-Ju
Wang, Hsin-Yao
author_sort Yu, Jia-Ruei
collection PubMed
description BACKGROUND: The use of artificial intelligence (AI) in the medical domain has attracted considerable research interest. Inference applications in the medical domain require energy-efficient AI models. In contrast to other types of data in visual AI, data from medical laboratories usually comprise features with strong signals. Numerous energy optimization techniques have been developed to relieve the burden on the hardware required to deploy a complex learning model. However, the energy efficiency levels of different AI models used for medical applications have not been studied. OBJECTIVE: The aim of this study was to explore and compare the energy efficiency levels of commonly used machine learning algorithms—logistic regression (LR), k-nearest neighbor, support vector machine, random forest (RF), and extreme gradient boosting (XGB) algorithms, as well as four different variants of neural network (NN) algorithms—when applied to clinical laboratory datasets. METHODS: We applied the aforementioned algorithms to two distinct clinical laboratory data sets: a mass spectrometry data set regarding Staphylococcus aureus for predicting methicillin resistance (3338 cases; 268 features) and a urinalysis data set for predicting Trichomonas vaginalis infection (839,164 cases; 9 features). We compared the performance of the nine inference algorithms in terms of accuracy, area under the receiver operating characteristic curve (AUROC), time consumption, and power consumption. The time and power consumption levels were determined using performance counter data from Intel Power Gadget 3.5. RESULTS: The experimental results indicated that the RF and XGB algorithms achieved the two highest AUROC values for both data sets (84.7% and 83.9%, respectively, for the mass spectrometry data set; 91.1% and 91.4%, respectively, for the urinalysis data set). The XGB and LR algorithms exhibited the shortest inference time for both data sets (0.47 milliseconds for both in the mass spectrometry data set; 0.39 and 0.47 milliseconds, respectively, for the urinalysis data set). Compared with the RF algorithm, the XGB and LR algorithms exhibited a 45% and 53%-60% reduction in inference time for the mass spectrometry and urinalysis data sets, respectively. In terms of energy efficiency, the XGB algorithm exhibited the lowest power consumption for the mass spectrometry data set (9.42 Watts) and the LR algorithm exhibited the lowest power consumption for the urinalysis data set (9.98 Watts). Compared with a five-hidden-layer NN, the XGB and LR algorithms achieved 16%-24% and 9%-13% lower power consumption levels for the mass spectrometry and urinalysis data sets, respectively. In all experiments, the XGB algorithm exhibited the best performance in terms of accuracy, run time, and energy efficiency. CONCLUSIONS: The XGB algorithm achieved balanced performance levels in terms of AUROC, run time, and energy efficiency for the two clinical laboratory data sets. Considering the energy constraints in real-world scenarios, the XGB algorithm is ideal for medical AI applications.
format Online
Article
Text
id pubmed-8826151
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-88261512022-02-11 Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study Yu, Jia-Ruei Chen, Chun-Hsien Huang, Tsung-Wei Lu, Jang-Jih Chung, Chia-Ru Lin, Ting-Wei Wu, Min-Hsien Tseng, Yi-Ju Wang, Hsin-Yao J Med Internet Res Original Paper BACKGROUND: The use of artificial intelligence (AI) in the medical domain has attracted considerable research interest. Inference applications in the medical domain require energy-efficient AI models. In contrast to other types of data in visual AI, data from medical laboratories usually comprise features with strong signals. Numerous energy optimization techniques have been developed to relieve the burden on the hardware required to deploy a complex learning model. However, the energy efficiency levels of different AI models used for medical applications have not been studied. OBJECTIVE: The aim of this study was to explore and compare the energy efficiency levels of commonly used machine learning algorithms—logistic regression (LR), k-nearest neighbor, support vector machine, random forest (RF), and extreme gradient boosting (XGB) algorithms, as well as four different variants of neural network (NN) algorithms—when applied to clinical laboratory datasets. METHODS: We applied the aforementioned algorithms to two distinct clinical laboratory data sets: a mass spectrometry data set regarding Staphylococcus aureus for predicting methicillin resistance (3338 cases; 268 features) and a urinalysis data set for predicting Trichomonas vaginalis infection (839,164 cases; 9 features). We compared the performance of the nine inference algorithms in terms of accuracy, area under the receiver operating characteristic curve (AUROC), time consumption, and power consumption. The time and power consumption levels were determined using performance counter data from Intel Power Gadget 3.5. RESULTS: The experimental results indicated that the RF and XGB algorithms achieved the two highest AUROC values for both data sets (84.7% and 83.9%, respectively, for the mass spectrometry data set; 91.1% and 91.4%, respectively, for the urinalysis data set). The XGB and LR algorithms exhibited the shortest inference time for both data sets (0.47 milliseconds for both in the mass spectrometry data set; 0.39 and 0.47 milliseconds, respectively, for the urinalysis data set). Compared with the RF algorithm, the XGB and LR algorithms exhibited a 45% and 53%-60% reduction in inference time for the mass spectrometry and urinalysis data sets, respectively. In terms of energy efficiency, the XGB algorithm exhibited the lowest power consumption for the mass spectrometry data set (9.42 Watts) and the LR algorithm exhibited the lowest power consumption for the urinalysis data set (9.98 Watts). Compared with a five-hidden-layer NN, the XGB and LR algorithms achieved 16%-24% and 9%-13% lower power consumption levels for the mass spectrometry and urinalysis data sets, respectively. In all experiments, the XGB algorithm exhibited the best performance in terms of accuracy, run time, and energy efficiency. CONCLUSIONS: The XGB algorithm achieved balanced performance levels in terms of AUROC, run time, and energy efficiency for the two clinical laboratory data sets. Considering the energy constraints in real-world scenarios, the XGB algorithm is ideal for medical AI applications. JMIR Publications 2022-01-25 /pmc/articles/PMC8826151/ /pubmed/35076405 http://dx.doi.org/10.2196/28036 Text en ©Jia-Ruei Yu, Chun-Hsien Chen, Tsung-Wei Huang, Jang-Jih Lu, Chia-Ru Chung, Ting-Wei Lin, Min-Hsien Wu, Yi-Ju Tseng, Hsin-Yao Wang. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 25.01.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Yu, Jia-Ruei
Chen, Chun-Hsien
Huang, Tsung-Wei
Lu, Jang-Jih
Chung, Chia-Ru
Lin, Ting-Wei
Wu, Min-Hsien
Tseng, Yi-Ju
Wang, Hsin-Yao
Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study
title Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study
title_full Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study
title_fullStr Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study
title_full_unstemmed Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study
title_short Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study
title_sort energy efficiency of inference algorithms for clinical laboratory data sets: green artificial intelligence study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8826151/
https://www.ncbi.nlm.nih.gov/pubmed/35076405
http://dx.doi.org/10.2196/28036
work_keys_str_mv AT yujiaruei energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy
AT chenchunhsien energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy
AT huangtsungwei energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy
AT lujangjih energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy
AT chungchiaru energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy
AT lintingwei energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy
AT wuminhsien energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy
AT tsengyiju energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy
AT wanghsinyao energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy