Cargando…
A causal learning framework for the analysis and interpretation of COVID-19 clinical data
We present a workflow for clinical data analysis that relies on Bayesian Structure Learning (BSL), an unsupervised learning approach, robust to noise and biases, that allows to incorporate prior medical knowledge into the learning process and that provides explainable results in the form of a graph...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9119448/ https://www.ncbi.nlm.nih.gov/pubmed/35588440 http://dx.doi.org/10.1371/journal.pone.0268327 |
_version_ | 1784710702065254400 |
---|---|
author | Ferrari, Elisa Gargani, Luna Barbieri, Greta Ghiadoni, Lorenzo Faita, Francesco Bacciu, Davide |
author_facet | Ferrari, Elisa Gargani, Luna Barbieri, Greta Ghiadoni, Lorenzo Faita, Francesco Bacciu, Davide |
author_sort | Ferrari, Elisa |
collection | PubMed |
description | We present a workflow for clinical data analysis that relies on Bayesian Structure Learning (BSL), an unsupervised learning approach, robust to noise and biases, that allows to incorporate prior medical knowledge into the learning process and that provides explainable results in the form of a graph showing the causal connections among the analyzed features. The workflow consists in a multi-step approach that goes from identifying the main causes of patient’s outcome through BSL, to the realization of a tool suitable for clinical practice, based on a Binary Decision Tree (BDT), to recognize patients at high-risk with information available already at hospital admission time. We evaluate our approach on a feature-rich dataset of Coronavirus disease (COVID-19), showing that the proposed framework provides a schematic overview of the multi-factorial processes that jointly contribute to the outcome. We compare our findings with current literature on COVID-19, showing that this approach allows to re-discover established cause-effect relationships about the disease. Further, our approach yields to a highly interpretable tool correctly predicting the outcome of 85% of subjects based exclusively on 3 features: age, a previous history of chronic obstructive pulmonary disease and the PaO2/FiO2 ratio at the time of arrival to the hospital. The inclusion of additional information from 4 routine blood tests (Creatinine, Glucose, pO2 and Sodium) increases predictive accuracy to 94.5%. |
format | Online Article Text |
id | pubmed-9119448 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-91194482022-05-20 A causal learning framework for the analysis and interpretation of COVID-19 clinical data Ferrari, Elisa Gargani, Luna Barbieri, Greta Ghiadoni, Lorenzo Faita, Francesco Bacciu, Davide PLoS One Research Article We present a workflow for clinical data analysis that relies on Bayesian Structure Learning (BSL), an unsupervised learning approach, robust to noise and biases, that allows to incorporate prior medical knowledge into the learning process and that provides explainable results in the form of a graph showing the causal connections among the analyzed features. The workflow consists in a multi-step approach that goes from identifying the main causes of patient’s outcome through BSL, to the realization of a tool suitable for clinical practice, based on a Binary Decision Tree (BDT), to recognize patients at high-risk with information available already at hospital admission time. We evaluate our approach on a feature-rich dataset of Coronavirus disease (COVID-19), showing that the proposed framework provides a schematic overview of the multi-factorial processes that jointly contribute to the outcome. We compare our findings with current literature on COVID-19, showing that this approach allows to re-discover established cause-effect relationships about the disease. Further, our approach yields to a highly interpretable tool correctly predicting the outcome of 85% of subjects based exclusively on 3 features: age, a previous history of chronic obstructive pulmonary disease and the PaO2/FiO2 ratio at the time of arrival to the hospital. The inclusion of additional information from 4 routine blood tests (Creatinine, Glucose, pO2 and Sodium) increases predictive accuracy to 94.5%. Public Library of Science 2022-05-19 /pmc/articles/PMC9119448/ /pubmed/35588440 http://dx.doi.org/10.1371/journal.pone.0268327 Text en © 2022 Ferrari et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Ferrari, Elisa Gargani, Luna Barbieri, Greta Ghiadoni, Lorenzo Faita, Francesco Bacciu, Davide A causal learning framework for the analysis and interpretation of COVID-19 clinical data |
title | A causal learning framework for the analysis and interpretation of COVID-19 clinical data |
title_full | A causal learning framework for the analysis and interpretation of COVID-19 clinical data |
title_fullStr | A causal learning framework for the analysis and interpretation of COVID-19 clinical data |
title_full_unstemmed | A causal learning framework for the analysis and interpretation of COVID-19 clinical data |
title_short | A causal learning framework for the analysis and interpretation of COVID-19 clinical data |
title_sort | causal learning framework for the analysis and interpretation of covid-19 clinical data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9119448/ https://www.ncbi.nlm.nih.gov/pubmed/35588440 http://dx.doi.org/10.1371/journal.pone.0268327 |
work_keys_str_mv | AT ferrarielisa acausallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata AT garganiluna acausallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata AT barbierigreta acausallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata AT ghiadonilorenzo acausallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata AT faitafrancesco acausallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata AT bacciudavide acausallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata AT ferrarielisa causallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata AT garganiluna causallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata AT barbierigreta causallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata AT ghiadonilorenzo causallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata AT faitafrancesco causallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata AT bacciudavide causallearningframeworkfortheanalysisandinterpretationofcovid19clinicaldata |