Cargando…

Application and utility of boosting machine learning model based on laboratory test in the differential diagnosis of non-COVID-19 pneumonia and COVID-19

BACKGROUND: Non-Coronavirus disease 2019 (COVID-19) pneumonia and COVID-19 have similar clinical features but last for different periods, and consequently, require different treatment protocols. Therefore, they must be differentially diagnosed. This study uses artificial intelligence (AI) to classif...

Descripción completa

Detalles Bibliográficos
Autores principales: Baik, Seung Min, Hong, Kyung Sook, Park, Dong Jin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Author(s). Published by Elsevier Inc. on behalf of The Canadian Society of Clinical Chemists. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10197431/
https://www.ncbi.nlm.nih.gov/pubmed/37211061
http://dx.doi.org/10.1016/j.clinbiochem.2023.05.003
_version_ 1785044550531678208
author Baik, Seung Min
Hong, Kyung Sook
Park, Dong Jin
author_facet Baik, Seung Min
Hong, Kyung Sook
Park, Dong Jin
author_sort Baik, Seung Min
collection PubMed
description BACKGROUND: Non-Coronavirus disease 2019 (COVID-19) pneumonia and COVID-19 have similar clinical features but last for different periods, and consequently, require different treatment protocols. Therefore, they must be differentially diagnosed. This study uses artificial intelligence (AI) to classify the two forms of pneumonia using mainly laboratory test data. METHODS: Various AI models are applied, including boosting models known for deftly solving classification problems. In addition, important features that affect the classification prediction performance are identified using the feature importance technique and SHapley Additive exPlanations method. Despite the data imbalance, the developed model exhibits robust performance. RESULTS: eXtreme gradient boosting, category boosting, and light gradient boosted machine yield an area under the receiver operating characteristic of 0.99 or more, accuracy of 0.96–0.97, and F1-score of 0.96–0.97. In addition, D-dimer, eosinophil, glucose, aspartate aminotransferase, and basophil, which are rather nonspecific laboratory test results, are demonstrated to be important features in differentiating the two disease groups. CONCLUSIONS: The boosting model, which excels in producing classification models using categorical data, excels in developing classification models using linear numerical data, such as laboratory tests. Finally, the proposed model can be applied in various fields to solve classification problems.
format Online
Article
Text
id pubmed-10197431
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Author(s). Published by Elsevier Inc. on behalf of The Canadian Society of Clinical Chemists.
record_format MEDLINE/PubMed
spelling pubmed-101974312023-05-19 Application and utility of boosting machine learning model based on laboratory test in the differential diagnosis of non-COVID-19 pneumonia and COVID-19 Baik, Seung Min Hong, Kyung Sook Park, Dong Jin Clin Biochem Article BACKGROUND: Non-Coronavirus disease 2019 (COVID-19) pneumonia and COVID-19 have similar clinical features but last for different periods, and consequently, require different treatment protocols. Therefore, they must be differentially diagnosed. This study uses artificial intelligence (AI) to classify the two forms of pneumonia using mainly laboratory test data. METHODS: Various AI models are applied, including boosting models known for deftly solving classification problems. In addition, important features that affect the classification prediction performance are identified using the feature importance technique and SHapley Additive exPlanations method. Despite the data imbalance, the developed model exhibits robust performance. RESULTS: eXtreme gradient boosting, category boosting, and light gradient boosted machine yield an area under the receiver operating characteristic of 0.99 or more, accuracy of 0.96–0.97, and F1-score of 0.96–0.97. In addition, D-dimer, eosinophil, glucose, aspartate aminotransferase, and basophil, which are rather nonspecific laboratory test results, are demonstrated to be important features in differentiating the two disease groups. CONCLUSIONS: The boosting model, which excels in producing classification models using categorical data, excels in developing classification models using linear numerical data, such as laboratory tests. Finally, the proposed model can be applied in various fields to solve classification problems. The Author(s). Published by Elsevier Inc. on behalf of The Canadian Society of Clinical Chemists. 2023-08 2023-05-19 /pmc/articles/PMC10197431/ /pubmed/37211061 http://dx.doi.org/10.1016/j.clinbiochem.2023.05.003 Text en © 2023 The Author(s) Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Baik, Seung Min
Hong, Kyung Sook
Park, Dong Jin
Application and utility of boosting machine learning model based on laboratory test in the differential diagnosis of non-COVID-19 pneumonia and COVID-19
title Application and utility of boosting machine learning model based on laboratory test in the differential diagnosis of non-COVID-19 pneumonia and COVID-19
title_full Application and utility of boosting machine learning model based on laboratory test in the differential diagnosis of non-COVID-19 pneumonia and COVID-19
title_fullStr Application and utility of boosting machine learning model based on laboratory test in the differential diagnosis of non-COVID-19 pneumonia and COVID-19
title_full_unstemmed Application and utility of boosting machine learning model based on laboratory test in the differential diagnosis of non-COVID-19 pneumonia and COVID-19
title_short Application and utility of boosting machine learning model based on laboratory test in the differential diagnosis of non-COVID-19 pneumonia and COVID-19
title_sort application and utility of boosting machine learning model based on laboratory test in the differential diagnosis of non-covid-19 pneumonia and covid-19
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10197431/
https://www.ncbi.nlm.nih.gov/pubmed/37211061
http://dx.doi.org/10.1016/j.clinbiochem.2023.05.003
work_keys_str_mv AT baikseungmin applicationandutilityofboostingmachinelearningmodelbasedonlaboratorytestinthedifferentialdiagnosisofnoncovid19pneumoniaandcovid19
AT hongkyungsook applicationandutilityofboostingmachinelearningmodelbasedonlaboratorytestinthedifferentialdiagnosisofnoncovid19pneumoniaandcovid19
AT parkdongjin applicationandutilityofboostingmachinelearningmodelbasedonlaboratorytestinthedifferentialdiagnosisofnoncovid19pneumoniaandcovid19