Cargando…
An Expectation-Maximization Algorithm for Including Oncological COVID-19 Deaths in Survival Analysis
We address the problem of how COVID-19 deaths observed in an oncology clinical trial can be consistently taken into account in typical survival estimates. We refer to oncological patients since there is empirical evidence of strong correlation between COVID-19 and cancer deaths, which implies that C...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9955008/ https://www.ncbi.nlm.nih.gov/pubmed/36826124 http://dx.doi.org/10.3390/curroncol30020163 |
_version_ | 1784894250849140736 |
---|---|
author | De Felice, Francesca Mazzoni, Luca Moriconi, Franco |
author_facet | De Felice, Francesca Mazzoni, Luca Moriconi, Franco |
author_sort | De Felice, Francesca |
collection | PubMed |
description | We address the problem of how COVID-19 deaths observed in an oncology clinical trial can be consistently taken into account in typical survival estimates. We refer to oncological patients since there is empirical evidence of strong correlation between COVID-19 and cancer deaths, which implies that COVID-19 deaths cannot be treated simply as non-informative censoring, a property usually required by the classical survival estimators. We consider the problem in the framework of the widely used Kaplan–Meier (KM) estimator. Through a counterfactual approach, an algorithmic method is developed allowing to include COVID-19 deaths in the observed data by mean-imputation. The procedure can be seen in the class of the Expectation-Maximization (EM) algorithms and will be referred to as Covid-Death Mean-Imputation (CoDMI) algorithm. We discuss the CoDMI underlying assumptions and the convergence issue. The algorithm provides a completed lifetime data set, where each Covid-death time is replaced by a point estimate of the corresponding virtual lifetime. This complete data set is naturally equipped with the corresponding KM survival function estimate and all available statistical tools can be applied to these data. However, mean-imputation requires an increased variance of the estimates. We then propose a natural extension of the classical Greenwood’s formula, thus obtaining expanded confidence intervals for the survival function estimate. To illustrate how the algorithm works, CoDMI is applied to real medical data extended by the addition of artificial Covid-death observations. The results are compared with the estimates provided by the two naïve approaches which count COVID-19 deaths as censoring or as deaths by the disease under study. In order to evaluate the predictive performances of CoDMI an extensive simulation study is carried out. The results indicate that in the simulated scenarios CoDMI is roughly unbiased and outperforms the estimates obtained by the naïve approaches. A user-friendly version of CoDMI programmed in R is freely available. |
format | Online Article Text |
id | pubmed-9955008 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-99550082023-02-25 An Expectation-Maximization Algorithm for Including Oncological COVID-19 Deaths in Survival Analysis De Felice, Francesca Mazzoni, Luca Moriconi, Franco Curr Oncol Article We address the problem of how COVID-19 deaths observed in an oncology clinical trial can be consistently taken into account in typical survival estimates. We refer to oncological patients since there is empirical evidence of strong correlation between COVID-19 and cancer deaths, which implies that COVID-19 deaths cannot be treated simply as non-informative censoring, a property usually required by the classical survival estimators. We consider the problem in the framework of the widely used Kaplan–Meier (KM) estimator. Through a counterfactual approach, an algorithmic method is developed allowing to include COVID-19 deaths in the observed data by mean-imputation. The procedure can be seen in the class of the Expectation-Maximization (EM) algorithms and will be referred to as Covid-Death Mean-Imputation (CoDMI) algorithm. We discuss the CoDMI underlying assumptions and the convergence issue. The algorithm provides a completed lifetime data set, where each Covid-death time is replaced by a point estimate of the corresponding virtual lifetime. This complete data set is naturally equipped with the corresponding KM survival function estimate and all available statistical tools can be applied to these data. However, mean-imputation requires an increased variance of the estimates. We then propose a natural extension of the classical Greenwood’s formula, thus obtaining expanded confidence intervals for the survival function estimate. To illustrate how the algorithm works, CoDMI is applied to real medical data extended by the addition of artificial Covid-death observations. The results are compared with the estimates provided by the two naïve approaches which count COVID-19 deaths as censoring or as deaths by the disease under study. In order to evaluate the predictive performances of CoDMI an extensive simulation study is carried out. The results indicate that in the simulated scenarios CoDMI is roughly unbiased and outperforms the estimates obtained by the naïve approaches. A user-friendly version of CoDMI programmed in R is freely available. MDPI 2023-02-08 /pmc/articles/PMC9955008/ /pubmed/36826124 http://dx.doi.org/10.3390/curroncol30020163 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article De Felice, Francesca Mazzoni, Luca Moriconi, Franco An Expectation-Maximization Algorithm for Including Oncological COVID-19 Deaths in Survival Analysis |
title | An Expectation-Maximization Algorithm for Including Oncological COVID-19 Deaths in Survival Analysis |
title_full | An Expectation-Maximization Algorithm for Including Oncological COVID-19 Deaths in Survival Analysis |
title_fullStr | An Expectation-Maximization Algorithm for Including Oncological COVID-19 Deaths in Survival Analysis |
title_full_unstemmed | An Expectation-Maximization Algorithm for Including Oncological COVID-19 Deaths in Survival Analysis |
title_short | An Expectation-Maximization Algorithm for Including Oncological COVID-19 Deaths in Survival Analysis |
title_sort | expectation-maximization algorithm for including oncological covid-19 deaths in survival analysis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9955008/ https://www.ncbi.nlm.nih.gov/pubmed/36826124 http://dx.doi.org/10.3390/curroncol30020163 |
work_keys_str_mv | AT defelicefrancesca anexpectationmaximizationalgorithmforincludingoncologicalcovid19deathsinsurvivalanalysis AT mazzoniluca anexpectationmaximizationalgorithmforincludingoncologicalcovid19deathsinsurvivalanalysis AT moriconifranco anexpectationmaximizationalgorithmforincludingoncologicalcovid19deathsinsurvivalanalysis AT defelicefrancesca expectationmaximizationalgorithmforincludingoncologicalcovid19deathsinsurvivalanalysis AT mazzoniluca expectationmaximizationalgorithmforincludingoncologicalcovid19deathsinsurvivalanalysis AT moriconifranco expectationmaximizationalgorithmforincludingoncologicalcovid19deathsinsurvivalanalysis |