Cargando…

Biomarkers of severe COVID-19 pneumonia on admission using data-mining powered by common laboratory blood tests-datasets

In the epidemiological COVID-19 research, artificial intelligence is a unique approach to make predictions about disease severity to manage COVID-19 patients. A limitation of artificial intelligence is, however, the high risk of bias. We investigated the skill of data mining and machine learning, tw...

Descripción completa

Detalles Bibliográficos
Autores principales: Pulgar-Sánchez, Mary, Chamorro, Kevin, Fors, Martha, Mora, Francisco X., Ramírez, Hégira, Fernandez-Moreira, Esteban, Ballaz, Santiago J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8349478/
https://www.ncbi.nlm.nih.gov/pubmed/34391001
http://dx.doi.org/10.1016/j.compbiomed.2021.104738
_version_ 1783735571879821312
author Pulgar-Sánchez, Mary
Chamorro, Kevin
Fors, Martha
Mora, Francisco X.
Ramírez, Hégira
Fernandez-Moreira, Esteban
Ballaz, Santiago J.
author_facet Pulgar-Sánchez, Mary
Chamorro, Kevin
Fors, Martha
Mora, Francisco X.
Ramírez, Hégira
Fernandez-Moreira, Esteban
Ballaz, Santiago J.
author_sort Pulgar-Sánchez, Mary
collection PubMed
description In the epidemiological COVID-19 research, artificial intelligence is a unique approach to make predictions about disease severity to manage COVID-19 patients. A limitation of artificial intelligence is, however, the high risk of bias. We investigated the skill of data mining and machine learning, two advanced forms of artificial intelligence, to predict severe COVID-19 pneumonia based on routine laboratory tests. A sample of 4009 COVID-19 patients was divided into Severe (PaO(2)< 60 mmHg, 489 cases) and Non-Severe (PaO(2) ≥ 60 mmHg, 3520 cases) groups according to blood hypoxemia on admission and their laboratory datasets analyzed by the R software and WEKA workbench. After curation, data were processed for the selection of the most influential features including hemogram, pCO(2), blood acid-base balance, prothrombin time, inflammation biomarkers, and glucose. The best fit of variables was successfully confirmed by either the Multilayer Perceptron, a feedforward neural network algorithm that performed machine recognition of severe COVID-19 with 96.5% precision, or by the C4.5 software, a supervised learning algorithm based on an objective-predefined variable (severity) that generated a decision tree with 89.4% precision. Finally, a complex bivariate Pearson's correlation matrix combined with advanced hierarchical clustering (dendrograms) were conducted for knowledge discovery. The hidden structure of the datasets revealed shift patterns related to the development of COVID-19-induced pneumonia that involved the lymphocyte-to-C-reactive protein and leukocyte-to-C-protein ratios, neutrophil %, pH and pCO(2). The data mining approaches to the hematological fluctuations associated with severe COVID-19 pneumonia could not only anticipate adverse clinical outcomes, but also reveal putative therapeutic targets.
format Online
Article
Text
id pubmed-8349478
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-83494782021-08-09 Biomarkers of severe COVID-19 pneumonia on admission using data-mining powered by common laboratory blood tests-datasets Pulgar-Sánchez, Mary Chamorro, Kevin Fors, Martha Mora, Francisco X. Ramírez, Hégira Fernandez-Moreira, Esteban Ballaz, Santiago J. Comput Biol Med Article In the epidemiological COVID-19 research, artificial intelligence is a unique approach to make predictions about disease severity to manage COVID-19 patients. A limitation of artificial intelligence is, however, the high risk of bias. We investigated the skill of data mining and machine learning, two advanced forms of artificial intelligence, to predict severe COVID-19 pneumonia based on routine laboratory tests. A sample of 4009 COVID-19 patients was divided into Severe (PaO(2)< 60 mmHg, 489 cases) and Non-Severe (PaO(2) ≥ 60 mmHg, 3520 cases) groups according to blood hypoxemia on admission and their laboratory datasets analyzed by the R software and WEKA workbench. After curation, data were processed for the selection of the most influential features including hemogram, pCO(2), blood acid-base balance, prothrombin time, inflammation biomarkers, and glucose. The best fit of variables was successfully confirmed by either the Multilayer Perceptron, a feedforward neural network algorithm that performed machine recognition of severe COVID-19 with 96.5% precision, or by the C4.5 software, a supervised learning algorithm based on an objective-predefined variable (severity) that generated a decision tree with 89.4% precision. Finally, a complex bivariate Pearson's correlation matrix combined with advanced hierarchical clustering (dendrograms) were conducted for knowledge discovery. The hidden structure of the datasets revealed shift patterns related to the development of COVID-19-induced pneumonia that involved the lymphocyte-to-C-reactive protein and leukocyte-to-C-protein ratios, neutrophil %, pH and pCO(2). The data mining approaches to the hematological fluctuations associated with severe COVID-19 pneumonia could not only anticipate adverse clinical outcomes, but also reveal putative therapeutic targets. Elsevier Ltd. 2021-09 2021-08-08 /pmc/articles/PMC8349478/ /pubmed/34391001 http://dx.doi.org/10.1016/j.compbiomed.2021.104738 Text en © 2021 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Pulgar-Sánchez, Mary
Chamorro, Kevin
Fors, Martha
Mora, Francisco X.
Ramírez, Hégira
Fernandez-Moreira, Esteban
Ballaz, Santiago J.
Biomarkers of severe COVID-19 pneumonia on admission using data-mining powered by common laboratory blood tests-datasets
title Biomarkers of severe COVID-19 pneumonia on admission using data-mining powered by common laboratory blood tests-datasets
title_full Biomarkers of severe COVID-19 pneumonia on admission using data-mining powered by common laboratory blood tests-datasets
title_fullStr Biomarkers of severe COVID-19 pneumonia on admission using data-mining powered by common laboratory blood tests-datasets
title_full_unstemmed Biomarkers of severe COVID-19 pneumonia on admission using data-mining powered by common laboratory blood tests-datasets
title_short Biomarkers of severe COVID-19 pneumonia on admission using data-mining powered by common laboratory blood tests-datasets
title_sort biomarkers of severe covid-19 pneumonia on admission using data-mining powered by common laboratory blood tests-datasets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8349478/
https://www.ncbi.nlm.nih.gov/pubmed/34391001
http://dx.doi.org/10.1016/j.compbiomed.2021.104738
work_keys_str_mv AT pulgarsanchezmary biomarkersofseverecovid19pneumoniaonadmissionusingdataminingpoweredbycommonlaboratorybloodtestsdatasets
AT chamorrokevin biomarkersofseverecovid19pneumoniaonadmissionusingdataminingpoweredbycommonlaboratorybloodtestsdatasets
AT forsmartha biomarkersofseverecovid19pneumoniaonadmissionusingdataminingpoweredbycommonlaboratorybloodtestsdatasets
AT morafranciscox biomarkersofseverecovid19pneumoniaonadmissionusingdataminingpoweredbycommonlaboratorybloodtestsdatasets
AT ramirezhegira biomarkersofseverecovid19pneumoniaonadmissionusingdataminingpoweredbycommonlaboratorybloodtestsdatasets
AT fernandezmoreiraesteban biomarkersofseverecovid19pneumoniaonadmissionusingdataminingpoweredbycommonlaboratorybloodtestsdatasets
AT ballazsantiagoj biomarkersofseverecovid19pneumoniaonadmissionusingdataminingpoweredbycommonlaboratorybloodtestsdatasets