Cargando…

A causal learning framework for the analysis and interpretation of COVID-19 clinical data

We present a workflow for clinical data analysis that relies on Bayesian Structure Learning (BSL), an unsupervised learning approach, robust to noise and biases, that allows to incorporate prior medical knowledge into the learning process and that provides explainable results in the form of a graph...

Descripción completa

Detalles Bibliográficos
Autores principales: Ferrari, Elisa, Gargani, Luna, Barbieri, Greta, Ghiadoni, Lorenzo, Faita, Francesco, Bacciu, Davide
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9119448/
https://www.ncbi.nlm.nih.gov/pubmed/35588440
http://dx.doi.org/10.1371/journal.pone.0268327
_version_ 1784710702065254400
author Ferrari, Elisa
Gargani, Luna
Barbieri, Greta
Ghiadoni, Lorenzo
Faita, Francesco
Bacciu, Davide
author_facet Ferrari, Elisa
Gargani, Luna
Barbieri, Greta
Ghiadoni, Lorenzo
Faita, Francesco
Bacciu, Davide
author_sort Ferrari, Elisa
collection PubMed
description We present a workflow for clinical data analysis that relies on Bayesian Structure Learning (BSL), an unsupervised learning approach, robust to noise and biases, that allows to incorporate prior medical knowledge into the learning process and that provides explainable results in the form of a graph showing the causal connections among the analyzed features. The workflow consists in a multi-step approach that goes from identifying the main causes of patient’s outcome through BSL, to the realization of a tool suitable for clinical practice, based on a Binary Decision Tree (BDT), to recognize patients at high-risk with information available already at hospital admission time. We evaluate our approach on a feature-rich dataset of Coronavirus disease (COVID-19), showing that the proposed framework provides a schematic overview of the multi-factorial processes that jointly contribute to the outcome. We compare our findings with current literature on COVID-19, showing that this approach allows to re-discover established cause-effect relationships about the disease. Further, our approach yields to a highly interpretable tool correctly predicting the outcome of 85% of subjects based exclusively on 3 features: age, a previous history of chronic obstructive pulmonary disease and the PaO2/FiO2 ratio at the time of arrival to the hospital. The inclusion of additional information from 4 routine blood tests (Creatinine, Glucose, pO2 and Sodium) increases predictive accuracy to 94.5%.
format Online
Article
Text
id pubmed-9119448
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-91194482022-05-20 A causal learning framework for the analysis and interpretation of COVID-19 clinical data Ferrari, Elisa Gargani, Luna Barbieri, Greta Ghiadoni, Lorenzo Faita, Francesco Bacciu, Davide PLoS One Research Article We present a workflow for clinical data analysis that relies on Bayesian Structure Learning (BSL), an unsupervised learning approach, robust to noise and biases, that allows to incorporate prior medical knowledge into the learning process and that provides explainable results in the form of a graph showing the causal connections among the analyzed features. The workflow consists in a multi-step approach that goes from identifying the main causes of patient’s outcome through BSL, to the realization of a tool suitable for clinical practice, based on a Binary Decision Tree (BDT), to recognize patients at high-risk with information available already at hospital admission time. We evaluate our approach on a feature-rich dataset of Coronavirus disease (COVID-19), showing that the proposed framework provides a schematic overview of the multi-factorial processes that jointly contribute to the outcome. We compare our findings with current literature on COVID-19, showing that this approach allows to re-discover established cause-effect relationships about the disease. Further, our approach yields to a highly interpretable tool correctly predicting the outcome of 85% of subjects based exclusively on 3 features: age, a previous history of chronic obstructive pulmonary disease and the PaO2/FiO2 ratio at the time of arrival to the hospital. The inclusion of additional information from 4 routine blood tests (Creatinine, Glucose, pO2 and Sodium) increases predictive accuracy to 94.5%. Public Library of Science 2022-05-19 /pmc/articles/PMC9119448/ /pubmed/35588440 http://dx.doi.org/10.1371/journal.pone.0268327 Text en © 2022 Ferrari et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Ferrari, Elisa
Gargani, Luna
Barbieri, Greta
Ghiadoni, Lorenzo
Faita, Francesco
Bacciu, Davide
A causal learning framework for the analysis and interpretation of COVID-19 clinical data
title A causal learning framework for the analysis and interpretation of COVID-19 clinical data
title_full A causal learning framework for the analysis and interpretation of COVID-19 clinical data
title_fullStr A causal learning framework for the analysis and interpretation of COVID-19 clinical data
title_full_unstemmed A causal learning framework for the analysis and interpretation of COVID-19 clinical data
title_short A causal learning framework for the analysis and interpretation of COVID-19 clinical data
title_sort causal learning framework for the analysis and interpretation of covid-19 clinical data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9119448/
https://www.ncbi.nlm.nih.gov/pubmed/35588440
http://dx.doi.org/10.1371/journal.pone.0268327
work_keys_str_mv AT ferrarielisa acausallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata
AT garganiluna acausallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata
AT barbierigreta acausallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata
AT ghiadonilorenzo acausallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata
AT faitafrancesco acausallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata
AT bacciudavide acausallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata
AT ferrarielisa causallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata
AT garganiluna causallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata
AT barbierigreta causallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata
AT ghiadonilorenzo causallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata
AT faitafrancesco causallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata
AT bacciudavide causallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata