Cargando…

Machine learning-assisted prediction of pneumonia based on non-invasive measures

BACKGROUND: Pneumonia is an infection of the lungs that is characterized by high morbidity and mortality. The use of machine learning systems to detect respiratory diseases via non-invasive measures such as physical and laboratory parameters is gaining momentum and has been proposed to decrease diag...

Descripción completa

Detalles Bibliográficos
Autores principales:	Effah, Clement Yaw, Miao, Ruoqi, Drokow, Emmanuel Kwateng, Agboyibor, Clement, Qiao, Ruiping, Wu, Yongjun, Miao, Lijun, Wang, Yanbin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Public Health
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9371749/ https://www.ncbi.nlm.nih.gov/pubmed/35968461 http://dx.doi.org/10.3389/fpubh.2022.938801

_version_	1784767226159562752
author	Effah, Clement Yaw Miao, Ruoqi Drokow, Emmanuel Kwateng Agboyibor, Clement Qiao, Ruiping Wu, Yongjun Miao, Lijun Wang, Yanbin
author_facet	Effah, Clement Yaw Miao, Ruoqi Drokow, Emmanuel Kwateng Agboyibor, Clement Qiao, Ruiping Wu, Yongjun Miao, Lijun Wang, Yanbin
author_sort	Effah, Clement Yaw
collection	PubMed
description	BACKGROUND: Pneumonia is an infection of the lungs that is characterized by high morbidity and mortality. The use of machine learning systems to detect respiratory diseases via non-invasive measures such as physical and laboratory parameters is gaining momentum and has been proposed to decrease diagnostic uncertainty associated with bacterial pneumonia. Herein, this study conducted several experiments using eight machine learning models to predict pneumonia based on biomarkers, laboratory parameters, and physical features. METHODS: We perform machine-learning analysis on 535 different patients, each with 45 features. Data normalization to rescale all real-valued features was performed. Since it is a binary problem, we categorized each patient into one class at a time. We designed three experiments to evaluate the models: (1) feature selection techniques to select appropriate features for the models, (2) experiments on the imbalanced original dataset, and (3) experiments on the SMOTE data. We then compared eight machine learning models to evaluate their effectiveness in predicting pneumonia RESULTS: Biomarkers such as C-reactive protein and procalcitonin demonstrated the most significant discriminating power. Ensemble machine learning models such as RF (accuracy = 92.0%, precision = 91.3%, recall = 96.0%, f1-Score = 93.6%) and XGBoost (accuracy = 90.8%, precision = 92.6%, recall = 92.3%, f1-score = 92.4%) achieved the highest performance accuracy on the original dataset with AUCs of 0.96 and 0.97, respectively. On the SMOTE dataset, RF and XGBoost achieved the highest prediction results with f1-scores of 92.0 and 91.2%, respectively. Also, AUC of 0.97 was achieved for both RF and XGBoost models. CONCLUSIONS: Our models showed that in the diagnosis of pneumonia, individual clinical history, laboratory indicators, and symptoms do not have adequate discriminatory power. We can also conclude that the ensemble ML models performed better in this study.
format	Online Article Text
id	pubmed-9371749
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-93717492022-08-12 Machine learning-assisted prediction of pneumonia based on non-invasive measures Effah, Clement Yaw Miao, Ruoqi Drokow, Emmanuel Kwateng Agboyibor, Clement Qiao, Ruiping Wu, Yongjun Miao, Lijun Wang, Yanbin Front Public Health Public Health BACKGROUND: Pneumonia is an infection of the lungs that is characterized by high morbidity and mortality. The use of machine learning systems to detect respiratory diseases via non-invasive measures such as physical and laboratory parameters is gaining momentum and has been proposed to decrease diagnostic uncertainty associated with bacterial pneumonia. Herein, this study conducted several experiments using eight machine learning models to predict pneumonia based on biomarkers, laboratory parameters, and physical features. METHODS: We perform machine-learning analysis on 535 different patients, each with 45 features. Data normalization to rescale all real-valued features was performed. Since it is a binary problem, we categorized each patient into one class at a time. We designed three experiments to evaluate the models: (1) feature selection techniques to select appropriate features for the models, (2) experiments on the imbalanced original dataset, and (3) experiments on the SMOTE data. We then compared eight machine learning models to evaluate their effectiveness in predicting pneumonia RESULTS: Biomarkers such as C-reactive protein and procalcitonin demonstrated the most significant discriminating power. Ensemble machine learning models such as RF (accuracy = 92.0%, precision = 91.3%, recall = 96.0%, f1-Score = 93.6%) and XGBoost (accuracy = 90.8%, precision = 92.6%, recall = 92.3%, f1-score = 92.4%) achieved the highest performance accuracy on the original dataset with AUCs of 0.96 and 0.97, respectively. On the SMOTE dataset, RF and XGBoost achieved the highest prediction results with f1-scores of 92.0 and 91.2%, respectively. Also, AUC of 0.97 was achieved for both RF and XGBoost models. CONCLUSIONS: Our models showed that in the diagnosis of pneumonia, individual clinical history, laboratory indicators, and symptoms do not have adequate discriminatory power. We can also conclude that the ensemble ML models performed better in this study. Frontiers Media S.A. 2022-07-28 /pmc/articles/PMC9371749/ /pubmed/35968461 http://dx.doi.org/10.3389/fpubh.2022.938801 Text en Copyright © 2022 Effah, Miao, Drokow, Agboyibor, Qiao, Wu, Miao and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Public Health Effah, Clement Yaw Miao, Ruoqi Drokow, Emmanuel Kwateng Agboyibor, Clement Qiao, Ruiping Wu, Yongjun Miao, Lijun Wang, Yanbin Machine learning-assisted prediction of pneumonia based on non-invasive measures
title	Machine learning-assisted prediction of pneumonia based on non-invasive measures
title_full	Machine learning-assisted prediction of pneumonia based on non-invasive measures
title_fullStr	Machine learning-assisted prediction of pneumonia based on non-invasive measures
title_full_unstemmed	Machine learning-assisted prediction of pneumonia based on non-invasive measures
title_short	Machine learning-assisted prediction of pneumonia based on non-invasive measures
title_sort	machine learning-assisted prediction of pneumonia based on non-invasive measures
topic	Public Health
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9371749/ https://www.ncbi.nlm.nih.gov/pubmed/35968461 http://dx.doi.org/10.3389/fpubh.2022.938801
work_keys_str_mv	AT effahclementyaw machinelearningassistedpredictionofpneumoniabasedonnoninvasivemeasures AT miaoruoqi machinelearningassistedpredictionofpneumoniabasedonnoninvasivemeasures AT drokowemmanuelkwateng machinelearningassistedpredictionofpneumoniabasedonnoninvasivemeasures AT agboyiborclement machinelearningassistedpredictionofpneumoniabasedonnoninvasivemeasures AT qiaoruiping machinelearningassistedpredictionofpneumoniabasedonnoninvasivemeasures AT wuyongjun machinelearningassistedpredictionofpneumoniabasedonnoninvasivemeasures AT miaolijun machinelearningassistedpredictionofpneumoniabasedonnoninvasivemeasures AT wangyanbin machinelearningassistedpredictionofpneumoniabasedonnoninvasivemeasures

Machine learning-assisted prediction of pneumonia based on non-invasive measures

Ejemplares similares