Cargando…
The impact of different strategies to handle missing data on both precision and bias in a drug safety study: a multidatabase multinational population-based cohort study
BACKGROUND: Missing data are often an issue in electronic medical records (EMRs) research. However, there are many ways that people deal with missing data in drug safety studies. AIM: To compare the risk estimates resulting from different strategies for the handling of missing data in the study of v...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Dove Medical Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5993167/ https://www.ncbi.nlm.nih.gov/pubmed/29892204 http://dx.doi.org/10.2147/CLEP.S154914 |
_version_ | 1783330191805775872 |
---|---|
author | Martín-Merino, Elisa Calderón-Larrañaga, Amaia Hawley, Samuel Poblador-Plou, Beatriz Llorente-García, Ana Petersen, Irene Prieto-Alhambra, Daniel |
author_facet | Martín-Merino, Elisa Calderón-Larrañaga, Amaia Hawley, Samuel Poblador-Plou, Beatriz Llorente-García, Ana Petersen, Irene Prieto-Alhambra, Daniel |
author_sort | Martín-Merino, Elisa |
collection | PubMed |
description | BACKGROUND: Missing data are often an issue in electronic medical records (EMRs) research. However, there are many ways that people deal with missing data in drug safety studies. AIM: To compare the risk estimates resulting from different strategies for the handling of missing data in the study of venous thromboembolism (VTE) risk associated with antiosteoporotic medications (AOM). METHODS: New users of AOM (alendronic acid, other bisphosphonates, strontium ranelate, selective estrogen receptor modulators, teriparatide, or denosumab) aged ≥50 years during 1998–2014 were identified in two Spanish (the Base de datos para la Investigación Farmacoepidemiológica en Atención Primaria [BIFAP] and EpiChron cohort) and one UK (Clinical Practice Research Datalink [CPRD]) EMR. Hazard ratios (HRs) according to AOM (with alendronic acid as reference) were calculated adjusting for VTE risk factors, body mass index (that was missing in 61% of patients included in the three databases), and smoking (that was missing in 23% of patients) in the year of AOM therapy initiation. HRs and standard errors obtained using cross-sectional multiple imputation (MI) (reference method) were compared to complete case (CC) analysis – using only patients with complete data – and longitudinal MI – adding to the cross-sectional MI model the body mass index/smoking values as recorded in the year before and after therapy initiation. RESULTS: Overall, 422/95,057 (0.4%), 19/12,688 (0.1%), and 2,051/161,202 (1.3%) VTE cases/participants were seen in BIFAP, EpiChron, and CPRD, respectively. HRs moved from 100.00% underestimation to 40.31% overestimation in CC compared with cross-sectional MI, while longitudinal MI methods provided similar risk estimates compared with cross-sectional MI. Precision for HR improved in cross-sectional MI versus CC by up to 160.28%, while longitudinal MI improved precision (compared with cross-sectional) only minimally (up to 0.80%). CONCLUSION: CC may substantially affect relative risk estimation in EMR-based drug safety studies, since missing data are not often completely at random. Little improvement was seen in these data in terms of power with the inclusion of longitudinal MI compared with cross-sectional MI. The strategy for handling missing data in drug safety studies can have a large impact on both risk estimates and precision. |
format | Online Article Text |
id | pubmed-5993167 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Dove Medical Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-59931672018-06-11 The impact of different strategies to handle missing data on both precision and bias in a drug safety study: a multidatabase multinational population-based cohort study Martín-Merino, Elisa Calderón-Larrañaga, Amaia Hawley, Samuel Poblador-Plou, Beatriz Llorente-García, Ana Petersen, Irene Prieto-Alhambra, Daniel Clin Epidemiol Methodology BACKGROUND: Missing data are often an issue in electronic medical records (EMRs) research. However, there are many ways that people deal with missing data in drug safety studies. AIM: To compare the risk estimates resulting from different strategies for the handling of missing data in the study of venous thromboembolism (VTE) risk associated with antiosteoporotic medications (AOM). METHODS: New users of AOM (alendronic acid, other bisphosphonates, strontium ranelate, selective estrogen receptor modulators, teriparatide, or denosumab) aged ≥50 years during 1998–2014 were identified in two Spanish (the Base de datos para la Investigación Farmacoepidemiológica en Atención Primaria [BIFAP] and EpiChron cohort) and one UK (Clinical Practice Research Datalink [CPRD]) EMR. Hazard ratios (HRs) according to AOM (with alendronic acid as reference) were calculated adjusting for VTE risk factors, body mass index (that was missing in 61% of patients included in the three databases), and smoking (that was missing in 23% of patients) in the year of AOM therapy initiation. HRs and standard errors obtained using cross-sectional multiple imputation (MI) (reference method) were compared to complete case (CC) analysis – using only patients with complete data – and longitudinal MI – adding to the cross-sectional MI model the body mass index/smoking values as recorded in the year before and after therapy initiation. RESULTS: Overall, 422/95,057 (0.4%), 19/12,688 (0.1%), and 2,051/161,202 (1.3%) VTE cases/participants were seen in BIFAP, EpiChron, and CPRD, respectively. HRs moved from 100.00% underestimation to 40.31% overestimation in CC compared with cross-sectional MI, while longitudinal MI methods provided similar risk estimates compared with cross-sectional MI. Precision for HR improved in cross-sectional MI versus CC by up to 160.28%, while longitudinal MI improved precision (compared with cross-sectional) only minimally (up to 0.80%). CONCLUSION: CC may substantially affect relative risk estimation in EMR-based drug safety studies, since missing data are not often completely at random. Little improvement was seen in these data in terms of power with the inclusion of longitudinal MI compared with cross-sectional MI. The strategy for handling missing data in drug safety studies can have a large impact on both risk estimates and precision. Dove Medical Press 2018-06-05 /pmc/articles/PMC5993167/ /pubmed/29892204 http://dx.doi.org/10.2147/CLEP.S154914 Text en © 2018 Martín-Merino et al. This work is published and licensed by Dove Medical Press Limited The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. |
spellingShingle | Methodology Martín-Merino, Elisa Calderón-Larrañaga, Amaia Hawley, Samuel Poblador-Plou, Beatriz Llorente-García, Ana Petersen, Irene Prieto-Alhambra, Daniel The impact of different strategies to handle missing data on both precision and bias in a drug safety study: a multidatabase multinational population-based cohort study |
title | The impact of different strategies to handle missing data on both precision and bias in a drug safety study: a multidatabase multinational population-based cohort study |
title_full | The impact of different strategies to handle missing data on both precision and bias in a drug safety study: a multidatabase multinational population-based cohort study |
title_fullStr | The impact of different strategies to handle missing data on both precision and bias in a drug safety study: a multidatabase multinational population-based cohort study |
title_full_unstemmed | The impact of different strategies to handle missing data on both precision and bias in a drug safety study: a multidatabase multinational population-based cohort study |
title_short | The impact of different strategies to handle missing data on both precision and bias in a drug safety study: a multidatabase multinational population-based cohort study |
title_sort | impact of different strategies to handle missing data on both precision and bias in a drug safety study: a multidatabase multinational population-based cohort study |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5993167/ https://www.ncbi.nlm.nih.gov/pubmed/29892204 http://dx.doi.org/10.2147/CLEP.S154914 |
work_keys_str_mv | AT martinmerinoelisa theimpactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy AT calderonlarranagaamaia theimpactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy AT hawleysamuel theimpactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy AT pobladorploubeatriz theimpactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy AT llorentegarciaana theimpactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy AT petersenirene theimpactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy AT prietoalhambradaniel theimpactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy AT martinmerinoelisa impactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy AT calderonlarranagaamaia impactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy AT hawleysamuel impactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy AT pobladorploubeatriz impactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy AT llorentegarciaana impactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy AT petersenirene impactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy AT prietoalhambradaniel impactofdifferentstrategiestohandlemissingdataonbothprecisionandbiasinadrugsafetystudyamultidatabasemultinationalpopulationbasedcohortstudy |