Cargando…

Simple parametric survival analysis with anonymized register data: A cohort study with truncated and interval censored event and censoring times

BACKGROUND: To preserve patient anonymity, health register data may be provided as binned data only. Here we consider as example, how to estimate mean survival time after a diagnosis of metastatic colorectal cancer from Norwegian register data on time to death or censoring binned into 30 day interva...

Descripción completa

Detalles Bibliográficos
Autores principales: Støvring, Henrik, Kristiansen, Ivar S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3748025/
https://www.ncbi.nlm.nih.gov/pubmed/21867515
http://dx.doi.org/10.1186/1756-0500-4-308
_version_ 1782281019059077120
author Støvring, Henrik
Kristiansen, Ivar S
author_facet Støvring, Henrik
Kristiansen, Ivar S
author_sort Støvring, Henrik
collection PubMed
description BACKGROUND: To preserve patient anonymity, health register data may be provided as binned data only. Here we consider as example, how to estimate mean survival time after a diagnosis of metastatic colorectal cancer from Norwegian register data on time to death or censoring binned into 30 day intervals. All events occurring in the first three months (90 days) after diagnosis were removed to achieve comparability with a clinical trial. The aim of the paper is to develop and implement a simple, and yet flexible method for analyzing such interval censored and truncated data. METHODS: Considering interval censoring a missing data problem, we implement a simple multiple imputation strategy that allows flexible sensitivity analyses with respect to the shape of the censoring distribution. To allow identification of appropriate parametric models, a χ(2)-goodness-of-fit test--also imputation based--is derived and supplemented with diagnostic plots. Uncertainty estimates for mean survival times are obtained via a simulation strategy. The validity and statistical efficiency of the proposed method for varying interval lengths is investigated in a simulation study and compared with simpler alternatives. RESULTS: Mean survival times estimated from the register data ranged from 1.2 (SE = 0.09) to 3.2 (0.31) years depending on period of diagnosis and choice of parametric model. The shape of the censoring distribution within intervals did generally not influence results, whereas the choice of parametric model did, even when different models fit the data equally well. In simulation studies both simple midpoint imputation and multiple imputation yielded nearly unbiased analyses (relative biases of -0.6% to 9.4%) and confidence intervals with near-nominal coverage probabilities (93.4% to 95.7%) for censoring intervals shorter than six months. For 12 month censoring intervals, multiple imputation provided better protection against bias, and coverage probabilities closer to nominal values than simple midpoint imputation. CONCLUSION: Binning of event and censoring times should be considered a viable strategy for anonymizing register data on survival times, as they may be readily analyzed with methods based on multiple imputation.
format Online
Article
Text
id pubmed-3748025
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37480252013-08-22 Simple parametric survival analysis with anonymized register data: A cohort study with truncated and interval censored event and censoring times Støvring, Henrik Kristiansen, Ivar S BMC Res Notes Research Article BACKGROUND: To preserve patient anonymity, health register data may be provided as binned data only. Here we consider as example, how to estimate mean survival time after a diagnosis of metastatic colorectal cancer from Norwegian register data on time to death or censoring binned into 30 day intervals. All events occurring in the first three months (90 days) after diagnosis were removed to achieve comparability with a clinical trial. The aim of the paper is to develop and implement a simple, and yet flexible method for analyzing such interval censored and truncated data. METHODS: Considering interval censoring a missing data problem, we implement a simple multiple imputation strategy that allows flexible sensitivity analyses with respect to the shape of the censoring distribution. To allow identification of appropriate parametric models, a χ(2)-goodness-of-fit test--also imputation based--is derived and supplemented with diagnostic plots. Uncertainty estimates for mean survival times are obtained via a simulation strategy. The validity and statistical efficiency of the proposed method for varying interval lengths is investigated in a simulation study and compared with simpler alternatives. RESULTS: Mean survival times estimated from the register data ranged from 1.2 (SE = 0.09) to 3.2 (0.31) years depending on period of diagnosis and choice of parametric model. The shape of the censoring distribution within intervals did generally not influence results, whereas the choice of parametric model did, even when different models fit the data equally well. In simulation studies both simple midpoint imputation and multiple imputation yielded nearly unbiased analyses (relative biases of -0.6% to 9.4%) and confidence intervals with near-nominal coverage probabilities (93.4% to 95.7%) for censoring intervals shorter than six months. For 12 month censoring intervals, multiple imputation provided better protection against bias, and coverage probabilities closer to nominal values than simple midpoint imputation. CONCLUSION: Binning of event and censoring times should be considered a viable strategy for anonymizing register data on survival times, as they may be readily analyzed with methods based on multiple imputation. BioMed Central 2011-08-25 /pmc/articles/PMC3748025/ /pubmed/21867515 http://dx.doi.org/10.1186/1756-0500-4-308 Text en Copyright ©2011 Støvring et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Støvring, Henrik
Kristiansen, Ivar S
Simple parametric survival analysis with anonymized register data: A cohort study with truncated and interval censored event and censoring times
title Simple parametric survival analysis with anonymized register data: A cohort study with truncated and interval censored event and censoring times
title_full Simple parametric survival analysis with anonymized register data: A cohort study with truncated and interval censored event and censoring times
title_fullStr Simple parametric survival analysis with anonymized register data: A cohort study with truncated and interval censored event and censoring times
title_full_unstemmed Simple parametric survival analysis with anonymized register data: A cohort study with truncated and interval censored event and censoring times
title_short Simple parametric survival analysis with anonymized register data: A cohort study with truncated and interval censored event and censoring times
title_sort simple parametric survival analysis with anonymized register data: a cohort study with truncated and interval censored event and censoring times
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3748025/
https://www.ncbi.nlm.nih.gov/pubmed/21867515
http://dx.doi.org/10.1186/1756-0500-4-308
work_keys_str_mv AT støvringhenrik simpleparametricsurvivalanalysiswithanonymizedregisterdataacohortstudywithtruncatedandintervalcensoredeventandcensoringtimes
AT kristiansenivars simpleparametricsurvivalanalysiswithanonymizedregisterdataacohortstudywithtruncatedandintervalcensoredeventandcensoringtimes