Cargando…

1162. A data-driven approach to predict plasma leakage using explainable machine learning

BACKGROUND: Dengue could cause complications with an estimated 10,000 deaths per annum. It mostly affects low- and middle-income countries such as Sri Lanka with limited healthcare resources to handle seasonal outbreaks. A third of dengue patients usually have a critical phase characterized by plasm...

Descripción completa

Detalles Bibliográficos
Autores principales: Marandi, Ramtin Zargari, Leung, Preston, Sigera, Chathurani, Murray, Daniel Dawson, Weeratunga, Praveen, Fernando, Deepika, Rodrigo, Chaturaka, Rajapakse, Senaka, MacPherson, Cameron
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9752983/
http://dx.doi.org/10.1093/ofid/ofac492.999
_version_ 1784850862446739456
author Marandi, Ramtin Zargari
Leung, Preston
Sigera, Chathurani
Murray, Daniel Dawson
Weeratunga, Praveen
Fernando, Deepika
Rodrigo, Chaturaka
Rajapakse, Senaka
MacPherson, Cameron
author_facet Marandi, Ramtin Zargari
Leung, Preston
Sigera, Chathurani
Murray, Daniel Dawson
Weeratunga, Praveen
Fernando, Deepika
Rodrigo, Chaturaka
Rajapakse, Senaka
MacPherson, Cameron
author_sort Marandi, Ramtin Zargari
collection PubMed
description BACKGROUND: Dengue could cause complications with an estimated 10,000 deaths per annum. It mostly affects low- and middle-income countries such as Sri Lanka with limited healthcare resources to handle seasonal outbreaks. A third of dengue patients usually have a critical phase characterized by plasma leakage with increased risk of life-threatening complications. A data-driven approach was required to find early predictors of plasma leakage that are usually available in routine care from a resource limited setting, as means of triaging patients for hospital admission. METHODS: We utilized a prospective cohort (The Colombo dengue study in Sri Lanka) that recruits patients meeting the clinical case definition of dengue fever. The cohort includes 4,781 instances of clinical signs, symptoms, and in-hospital laboratory tests from N=877 patients (60.3% patients infected by Dengue) recorded in first four days of fever. By excluding incomplete patient instances, the data was randomly split to a development set (N=378) and a test set (N=144). From the development set, five most informative features were identified using the minimum description length (MDL) algorithm. Logistic regression was used to create a prediction model using the development set. Shapley analysis was used to explain the model on the test set extracting the extent by which each predictor contributed to the predictions. RESULTS: The MDL algorithm revealed that hemoglobin (HGB), hematocrit (HCT), aspartate aminotransferase (AST), age, and gender to be the most informative predictors of plasma leakage. The logistic regression model predicted plasma leakage with (AUC = 0.76) on the test set (Fig 1). The HGB appeared to contribute the most to the predictions with the higher values associated with higher predicted risk of plasma leakage and vice versa (Fig 2). [Figure: see text] Figure 2 SHAP decision plot for the logistic regression model on the test set. The features are sorted from top to bottom by their mean absolute SHAP values (higher interpreted as more contributing). Feature values are normalised to [0 1] by the min-max normalisation method and colour-coded (grey points are missing values), outliers were squished to the range using Hampel filter. For gender, male is indicated by red and female by blue. The colour of each line is the same as the value of the feature connected to in downwards direction. [Figure: see text] CONCLUSION: Our results give support to the predictability of plasma leakage in patients suspected of Dengue fever in their first four days of fever onset. The study also underlines the relevance of the machine learning approach to identify the predictors and the practicality of the prediction model as reflected by the prediction performance to triage patients for hospital admission in resource limited settings. DISCLOSURES: All Authors: No reported disclosures.
format Online
Article
Text
id pubmed-9752983
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97529832022-12-16 1162. A data-driven approach to predict plasma leakage using explainable machine learning Marandi, Ramtin Zargari Leung, Preston Sigera, Chathurani Murray, Daniel Dawson Weeratunga, Praveen Fernando, Deepika Rodrigo, Chaturaka Rajapakse, Senaka MacPherson, Cameron Open Forum Infect Dis Abstracts BACKGROUND: Dengue could cause complications with an estimated 10,000 deaths per annum. It mostly affects low- and middle-income countries such as Sri Lanka with limited healthcare resources to handle seasonal outbreaks. A third of dengue patients usually have a critical phase characterized by plasma leakage with increased risk of life-threatening complications. A data-driven approach was required to find early predictors of plasma leakage that are usually available in routine care from a resource limited setting, as means of triaging patients for hospital admission. METHODS: We utilized a prospective cohort (The Colombo dengue study in Sri Lanka) that recruits patients meeting the clinical case definition of dengue fever. The cohort includes 4,781 instances of clinical signs, symptoms, and in-hospital laboratory tests from N=877 patients (60.3% patients infected by Dengue) recorded in first four days of fever. By excluding incomplete patient instances, the data was randomly split to a development set (N=378) and a test set (N=144). From the development set, five most informative features were identified using the minimum description length (MDL) algorithm. Logistic regression was used to create a prediction model using the development set. Shapley analysis was used to explain the model on the test set extracting the extent by which each predictor contributed to the predictions. RESULTS: The MDL algorithm revealed that hemoglobin (HGB), hematocrit (HCT), aspartate aminotransferase (AST), age, and gender to be the most informative predictors of plasma leakage. The logistic regression model predicted plasma leakage with (AUC = 0.76) on the test set (Fig 1). The HGB appeared to contribute the most to the predictions with the higher values associated with higher predicted risk of plasma leakage and vice versa (Fig 2). [Figure: see text] Figure 2 SHAP decision plot for the logistic regression model on the test set. The features are sorted from top to bottom by their mean absolute SHAP values (higher interpreted as more contributing). Feature values are normalised to [0 1] by the min-max normalisation method and colour-coded (grey points are missing values), outliers were squished to the range using Hampel filter. For gender, male is indicated by red and female by blue. The colour of each line is the same as the value of the feature connected to in downwards direction. [Figure: see text] CONCLUSION: Our results give support to the predictability of plasma leakage in patients suspected of Dengue fever in their first four days of fever onset. The study also underlines the relevance of the machine learning approach to identify the predictors and the practicality of the prediction model as reflected by the prediction performance to triage patients for hospital admission in resource limited settings. DISCLOSURES: All Authors: No reported disclosures. Oxford University Press 2022-12-15 /pmc/articles/PMC9752983/ http://dx.doi.org/10.1093/ofid/ofac492.999 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Infectious Diseases Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Abstracts
Marandi, Ramtin Zargari
Leung, Preston
Sigera, Chathurani
Murray, Daniel Dawson
Weeratunga, Praveen
Fernando, Deepika
Rodrigo, Chaturaka
Rajapakse, Senaka
MacPherson, Cameron
1162. A data-driven approach to predict plasma leakage using explainable machine learning
title 1162. A data-driven approach to predict plasma leakage using explainable machine learning
title_full 1162. A data-driven approach to predict plasma leakage using explainable machine learning
title_fullStr 1162. A data-driven approach to predict plasma leakage using explainable machine learning
title_full_unstemmed 1162. A data-driven approach to predict plasma leakage using explainable machine learning
title_short 1162. A data-driven approach to predict plasma leakage using explainable machine learning
title_sort 1162. a data-driven approach to predict plasma leakage using explainable machine learning
topic Abstracts
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9752983/
http://dx.doi.org/10.1093/ofid/ofac492.999
work_keys_str_mv AT marandiramtinzargari 1162adatadrivenapproachtopredictplasmaleakageusingexplainablemachinelearning
AT leungpreston 1162adatadrivenapproachtopredictplasmaleakageusingexplainablemachinelearning
AT sigerachathurani 1162adatadrivenapproachtopredictplasmaleakageusingexplainablemachinelearning
AT murraydanieldawson 1162adatadrivenapproachtopredictplasmaleakageusingexplainablemachinelearning
AT weeratungapraveen 1162adatadrivenapproachtopredictplasmaleakageusingexplainablemachinelearning
AT fernandodeepika 1162adatadrivenapproachtopredictplasmaleakageusingexplainablemachinelearning
AT rodrigochaturaka 1162adatadrivenapproachtopredictplasmaleakageusingexplainablemachinelearning
AT rajapaksesenaka 1162adatadrivenapproachtopredictplasmaleakageusingexplainablemachinelearning
AT macphersoncameron 1162adatadrivenapproachtopredictplasmaleakageusingexplainablemachinelearning