Cargando…

Identifying the DEAD: Development and Validation of a Patient-Level Model to Predict Death Status in Population-Level Claims Data

INTRODUCTION: US claims data contain medical data on large heterogeneous populations and are excellent sources for medical research. Some claims data do not contain complete death records, limiting their use for mortality or mortality-related studies. A model to predict whether a patient died at the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Reps, Jenna M., Rijnbeek, Peter R., Ryan, Patrick B.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2019
Materias:	Original Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6834730/ https://www.ncbi.nlm.nih.gov/pubmed/31054141 http://dx.doi.org/10.1007/s40264-019-00827-0

_version_	1783466538686218240
author	Reps, Jenna M. Rijnbeek, Peter R. Ryan, Patrick B.
author_facet	Reps, Jenna M. Rijnbeek, Peter R. Ryan, Patrick B.
author_sort	Reps, Jenna M.
collection	PubMed
description	INTRODUCTION: US claims data contain medical data on large heterogeneous populations and are excellent sources for medical research. Some claims data do not contain complete death records, limiting their use for mortality or mortality-related studies. A model to predict whether a patient died at the end of the follow-up time (referred to as the end of observation) is needed to enable mortality-related studies. OBJECTIVE: The objective of this study was to develop a patient-level model to predict whether the end of observation was due to death in US claims data. METHODS: We used a claims dataset with full death records, Optum(©) De-Identified Clinformatics(®) Data-Mart-Database—Date of Death mapped to the Observational Medical Outcome Partnership common data model, to develop a model that classifies the end of observations into death or non-death. A regularized logistic regression was trained using 88,514 predictors (recorded within the prior 365 or 30 days) and externally validated by applying the model to three US claims datasets. RESULTS: Approximately 25 in 1000 end of observations in Optum are due to death. The Discriminating End of observation into Alive and Dead (DEAD) model obtained an area under the receiver operating characteristic curve of 0.986. When defining death as a predicted risk of > 0.5, only 2% of the end of observations were predicted to be due to death and the model obtained a sensitivity of 62% and a positive predictive value of 74.8%. The external validation showed the model was transportable, with area under the receiver operating characteristic curves ranging between 0.951 and 0.995 across the US claims databases. CONCLUSIONS: US claims data often lack complete death records. The DEAD model can be used to impute death at various sensitivity, specificity, or positive predictive values depending on the use of the model. The DEAD model can be readily applied to any observational healthcare database mapped to the Observational Medical Outcome Partnership common data model and is available from https://github.com/OHDSI/StudyProtocolSandbox/tree/master/DeadModel. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s40264-019-00827-0) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-6834730
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-68347302019-11-20 Identifying the DEAD: Development and Validation of a Patient-Level Model to Predict Death Status in Population-Level Claims Data Reps, Jenna M. Rijnbeek, Peter R. Ryan, Patrick B. Drug Saf Original Research Article INTRODUCTION: US claims data contain medical data on large heterogeneous populations and are excellent sources for medical research. Some claims data do not contain complete death records, limiting their use for mortality or mortality-related studies. A model to predict whether a patient died at the end of the follow-up time (referred to as the end of observation) is needed to enable mortality-related studies. OBJECTIVE: The objective of this study was to develop a patient-level model to predict whether the end of observation was due to death in US claims data. METHODS: We used a claims dataset with full death records, Optum(©) De-Identified Clinformatics(®) Data-Mart-Database—Date of Death mapped to the Observational Medical Outcome Partnership common data model, to develop a model that classifies the end of observations into death or non-death. A regularized logistic regression was trained using 88,514 predictors (recorded within the prior 365 or 30 days) and externally validated by applying the model to three US claims datasets. RESULTS: Approximately 25 in 1000 end of observations in Optum are due to death. The Discriminating End of observation into Alive and Dead (DEAD) model obtained an area under the receiver operating characteristic curve of 0.986. When defining death as a predicted risk of > 0.5, only 2% of the end of observations were predicted to be due to death and the model obtained a sensitivity of 62% and a positive predictive value of 74.8%. The external validation showed the model was transportable, with area under the receiver operating characteristic curves ranging between 0.951 and 0.995 across the US claims databases. CONCLUSIONS: US claims data often lack complete death records. The DEAD model can be used to impute death at various sensitivity, specificity, or positive predictive values depending on the use of the model. The DEAD model can be readily applied to any observational healthcare database mapped to the Observational Medical Outcome Partnership common data model and is available from https://github.com/OHDSI/StudyProtocolSandbox/tree/master/DeadModel. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1007/s40264-019-00827-0) contains supplementary material, which is available to authorized users. Springer International Publishing 2019-05-03 2019 /pmc/articles/PMC6834730/ /pubmed/31054141 http://dx.doi.org/10.1007/s40264-019-00827-0 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/), which permits any noncommercial use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle	Original Research Article Reps, Jenna M. Rijnbeek, Peter R. Ryan, Patrick B. Identifying the DEAD: Development and Validation of a Patient-Level Model to Predict Death Status in Population-Level Claims Data
title	Identifying the DEAD: Development and Validation of a Patient-Level Model to Predict Death Status in Population-Level Claims Data
title_full	Identifying the DEAD: Development and Validation of a Patient-Level Model to Predict Death Status in Population-Level Claims Data
title_fullStr	Identifying the DEAD: Development and Validation of a Patient-Level Model to Predict Death Status in Population-Level Claims Data
title_full_unstemmed	Identifying the DEAD: Development and Validation of a Patient-Level Model to Predict Death Status in Population-Level Claims Data
title_short	Identifying the DEAD: Development and Validation of a Patient-Level Model to Predict Death Status in Population-Level Claims Data
title_sort	identifying the dead: development and validation of a patient-level model to predict death status in population-level claims data
topic	Original Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6834730/ https://www.ncbi.nlm.nih.gov/pubmed/31054141 http://dx.doi.org/10.1007/s40264-019-00827-0
work_keys_str_mv	AT repsjennam identifyingthedeaddevelopmentandvalidationofapatientlevelmodeltopredictdeathstatusinpopulationlevelclaimsdata AT rijnbeekpeterr identifyingthedeaddevelopmentandvalidationofapatientlevelmodeltopredictdeathstatusinpopulationlevelclaimsdata AT ryanpatrickb identifyingthedeaddevelopmentandvalidationofapatientlevelmodeltopredictdeathstatusinpopulationlevelclaimsdata

Identifying the DEAD: Development and Validation of a Patient-Level Model to Predict Death Status in Population-Level Claims Data

Ejemplares similares