Cargando…
Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study
BACKGROUND: The use of artificial intelligence (AI) in the medical domain has attracted considerable research interest. Inference applications in the medical domain require energy-efficient AI models. In contrast to other types of data in visual AI, data from medical laboratories usually comprise fe...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8826151/ https://www.ncbi.nlm.nih.gov/pubmed/35076405 http://dx.doi.org/10.2196/28036 |
_version_ | 1784647372609945600 |
---|---|
author | Yu, Jia-Ruei Chen, Chun-Hsien Huang, Tsung-Wei Lu, Jang-Jih Chung, Chia-Ru Lin, Ting-Wei Wu, Min-Hsien Tseng, Yi-Ju Wang, Hsin-Yao |
author_facet | Yu, Jia-Ruei Chen, Chun-Hsien Huang, Tsung-Wei Lu, Jang-Jih Chung, Chia-Ru Lin, Ting-Wei Wu, Min-Hsien Tseng, Yi-Ju Wang, Hsin-Yao |
author_sort | Yu, Jia-Ruei |
collection | PubMed |
description | BACKGROUND: The use of artificial intelligence (AI) in the medical domain has attracted considerable research interest. Inference applications in the medical domain require energy-efficient AI models. In contrast to other types of data in visual AI, data from medical laboratories usually comprise features with strong signals. Numerous energy optimization techniques have been developed to relieve the burden on the hardware required to deploy a complex learning model. However, the energy efficiency levels of different AI models used for medical applications have not been studied. OBJECTIVE: The aim of this study was to explore and compare the energy efficiency levels of commonly used machine learning algorithms—logistic regression (LR), k-nearest neighbor, support vector machine, random forest (RF), and extreme gradient boosting (XGB) algorithms, as well as four different variants of neural network (NN) algorithms—when applied to clinical laboratory datasets. METHODS: We applied the aforementioned algorithms to two distinct clinical laboratory data sets: a mass spectrometry data set regarding Staphylococcus aureus for predicting methicillin resistance (3338 cases; 268 features) and a urinalysis data set for predicting Trichomonas vaginalis infection (839,164 cases; 9 features). We compared the performance of the nine inference algorithms in terms of accuracy, area under the receiver operating characteristic curve (AUROC), time consumption, and power consumption. The time and power consumption levels were determined using performance counter data from Intel Power Gadget 3.5. RESULTS: The experimental results indicated that the RF and XGB algorithms achieved the two highest AUROC values for both data sets (84.7% and 83.9%, respectively, for the mass spectrometry data set; 91.1% and 91.4%, respectively, for the urinalysis data set). The XGB and LR algorithms exhibited the shortest inference time for both data sets (0.47 milliseconds for both in the mass spectrometry data set; 0.39 and 0.47 milliseconds, respectively, for the urinalysis data set). Compared with the RF algorithm, the XGB and LR algorithms exhibited a 45% and 53%-60% reduction in inference time for the mass spectrometry and urinalysis data sets, respectively. In terms of energy efficiency, the XGB algorithm exhibited the lowest power consumption for the mass spectrometry data set (9.42 Watts) and the LR algorithm exhibited the lowest power consumption for the urinalysis data set (9.98 Watts). Compared with a five-hidden-layer NN, the XGB and LR algorithms achieved 16%-24% and 9%-13% lower power consumption levels for the mass spectrometry and urinalysis data sets, respectively. In all experiments, the XGB algorithm exhibited the best performance in terms of accuracy, run time, and energy efficiency. CONCLUSIONS: The XGB algorithm achieved balanced performance levels in terms of AUROC, run time, and energy efficiency for the two clinical laboratory data sets. Considering the energy constraints in real-world scenarios, the XGB algorithm is ideal for medical AI applications. |
format | Online Article Text |
id | pubmed-8826151 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-88261512022-02-11 Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study Yu, Jia-Ruei Chen, Chun-Hsien Huang, Tsung-Wei Lu, Jang-Jih Chung, Chia-Ru Lin, Ting-Wei Wu, Min-Hsien Tseng, Yi-Ju Wang, Hsin-Yao J Med Internet Res Original Paper BACKGROUND: The use of artificial intelligence (AI) in the medical domain has attracted considerable research interest. Inference applications in the medical domain require energy-efficient AI models. In contrast to other types of data in visual AI, data from medical laboratories usually comprise features with strong signals. Numerous energy optimization techniques have been developed to relieve the burden on the hardware required to deploy a complex learning model. However, the energy efficiency levels of different AI models used for medical applications have not been studied. OBJECTIVE: The aim of this study was to explore and compare the energy efficiency levels of commonly used machine learning algorithms—logistic regression (LR), k-nearest neighbor, support vector machine, random forest (RF), and extreme gradient boosting (XGB) algorithms, as well as four different variants of neural network (NN) algorithms—when applied to clinical laboratory datasets. METHODS: We applied the aforementioned algorithms to two distinct clinical laboratory data sets: a mass spectrometry data set regarding Staphylococcus aureus for predicting methicillin resistance (3338 cases; 268 features) and a urinalysis data set for predicting Trichomonas vaginalis infection (839,164 cases; 9 features). We compared the performance of the nine inference algorithms in terms of accuracy, area under the receiver operating characteristic curve (AUROC), time consumption, and power consumption. The time and power consumption levels were determined using performance counter data from Intel Power Gadget 3.5. RESULTS: The experimental results indicated that the RF and XGB algorithms achieved the two highest AUROC values for both data sets (84.7% and 83.9%, respectively, for the mass spectrometry data set; 91.1% and 91.4%, respectively, for the urinalysis data set). The XGB and LR algorithms exhibited the shortest inference time for both data sets (0.47 milliseconds for both in the mass spectrometry data set; 0.39 and 0.47 milliseconds, respectively, for the urinalysis data set). Compared with the RF algorithm, the XGB and LR algorithms exhibited a 45% and 53%-60% reduction in inference time for the mass spectrometry and urinalysis data sets, respectively. In terms of energy efficiency, the XGB algorithm exhibited the lowest power consumption for the mass spectrometry data set (9.42 Watts) and the LR algorithm exhibited the lowest power consumption for the urinalysis data set (9.98 Watts). Compared with a five-hidden-layer NN, the XGB and LR algorithms achieved 16%-24% and 9%-13% lower power consumption levels for the mass spectrometry and urinalysis data sets, respectively. In all experiments, the XGB algorithm exhibited the best performance in terms of accuracy, run time, and energy efficiency. CONCLUSIONS: The XGB algorithm achieved balanced performance levels in terms of AUROC, run time, and energy efficiency for the two clinical laboratory data sets. Considering the energy constraints in real-world scenarios, the XGB algorithm is ideal for medical AI applications. JMIR Publications 2022-01-25 /pmc/articles/PMC8826151/ /pubmed/35076405 http://dx.doi.org/10.2196/28036 Text en ©Jia-Ruei Yu, Chun-Hsien Chen, Tsung-Wei Huang, Jang-Jih Lu, Chia-Ru Chung, Ting-Wei Lin, Min-Hsien Wu, Yi-Ju Tseng, Hsin-Yao Wang. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 25.01.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Yu, Jia-Ruei Chen, Chun-Hsien Huang, Tsung-Wei Lu, Jang-Jih Chung, Chia-Ru Lin, Ting-Wei Wu, Min-Hsien Tseng, Yi-Ju Wang, Hsin-Yao Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study |
title | Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study |
title_full | Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study |
title_fullStr | Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study |
title_full_unstemmed | Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study |
title_short | Energy Efficiency of Inference Algorithms for Clinical Laboratory Data Sets: Green Artificial Intelligence Study |
title_sort | energy efficiency of inference algorithms for clinical laboratory data sets: green artificial intelligence study |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8826151/ https://www.ncbi.nlm.nih.gov/pubmed/35076405 http://dx.doi.org/10.2196/28036 |
work_keys_str_mv | AT yujiaruei energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy AT chenchunhsien energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy AT huangtsungwei energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy AT lujangjih energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy AT chungchiaru energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy AT lintingwei energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy AT wuminhsien energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy AT tsengyiju energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy AT wanghsinyao energyefficiencyofinferencealgorithmsforclinicallaboratorydatasetsgreenartificialintelligencestudy |