Cargando…

Heckman imputation models for binary or continuous MNAR outcomes and MAR predictors

BACKGROUND: Multiple imputation by chained equations (MICE) requires specifying a suitable conditional imputation model for each incomplete variable and then iteratively imputes the missing values. In the presence of missing not at random (MNAR) outcomes, valid statistical inference often requires j...

Descripción completa

Detalles Bibliográficos
Autores principales: Galimard, Jacques-Emmanuel, Chevret, Sylvie, Curis, Emmanuel, Resche-Rigon, Matthieu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6119269/
https://www.ncbi.nlm.nih.gov/pubmed/30170561
http://dx.doi.org/10.1186/s12874-018-0547-1
_version_ 1783352055336796160
author Galimard, Jacques-Emmanuel
Chevret, Sylvie
Curis, Emmanuel
Resche-Rigon, Matthieu
author_facet Galimard, Jacques-Emmanuel
Chevret, Sylvie
Curis, Emmanuel
Resche-Rigon, Matthieu
author_sort Galimard, Jacques-Emmanuel
collection PubMed
description BACKGROUND: Multiple imputation by chained equations (MICE) requires specifying a suitable conditional imputation model for each incomplete variable and then iteratively imputes the missing values. In the presence of missing not at random (MNAR) outcomes, valid statistical inference often requires joint models for missing observations and their indicators of missingness. In this study, we derived an imputation model for missing binary data with MNAR mechanism from Heckman’s model using a one-step maximum likelihood estimator. We applied this approach to improve a previously developed approach for MNAR continuous outcomes using Heckman’s model and a two-step estimator. These models allow us to use a MICE process and can thus also handle missing at random (MAR) predictors in the same MICE process. METHODS: We simulated 1000 datasets of 500 cases. We generated the following missing data mechanisms on 30% of the outcomes: MAR mechanism, weak MNAR mechanism, and strong MNAR mechanism. We then resimulated the first three cases and added an additional 30% of MAR data on a predictor, resulting in 50% of complete cases. We evaluated and compared the performance of the developed approach to that of a complete case approach and classical Heckman’s model estimates. RESULTS: With MNAR outcomes, only methods using Heckman’s model were unbiased, and with a MAR predictor, the developed imputation approach outperformed all the other approaches. CONCLUSIONS: In the presence of MAR predictors, we proposed a simple approach to address MNAR binary or continuous outcomes under a Heckman assumption in a MICE procedure. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12874-018-0547-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6119269
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61192692018-09-05 Heckman imputation models for binary or continuous MNAR outcomes and MAR predictors Galimard, Jacques-Emmanuel Chevret, Sylvie Curis, Emmanuel Resche-Rigon, Matthieu BMC Med Res Methodol Research Article BACKGROUND: Multiple imputation by chained equations (MICE) requires specifying a suitable conditional imputation model for each incomplete variable and then iteratively imputes the missing values. In the presence of missing not at random (MNAR) outcomes, valid statistical inference often requires joint models for missing observations and their indicators of missingness. In this study, we derived an imputation model for missing binary data with MNAR mechanism from Heckman’s model using a one-step maximum likelihood estimator. We applied this approach to improve a previously developed approach for MNAR continuous outcomes using Heckman’s model and a two-step estimator. These models allow us to use a MICE process and can thus also handle missing at random (MAR) predictors in the same MICE process. METHODS: We simulated 1000 datasets of 500 cases. We generated the following missing data mechanisms on 30% of the outcomes: MAR mechanism, weak MNAR mechanism, and strong MNAR mechanism. We then resimulated the first three cases and added an additional 30% of MAR data on a predictor, resulting in 50% of complete cases. We evaluated and compared the performance of the developed approach to that of a complete case approach and classical Heckman’s model estimates. RESULTS: With MNAR outcomes, only methods using Heckman’s model were unbiased, and with a MAR predictor, the developed imputation approach outperformed all the other approaches. CONCLUSIONS: In the presence of MAR predictors, we proposed a simple approach to address MNAR binary or continuous outcomes under a Heckman assumption in a MICE procedure. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12874-018-0547-1) contains supplementary material, which is available to authorized users. BioMed Central 2018-08-31 /pmc/articles/PMC6119269/ /pubmed/30170561 http://dx.doi.org/10.1186/s12874-018-0547-1 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Galimard, Jacques-Emmanuel
Chevret, Sylvie
Curis, Emmanuel
Resche-Rigon, Matthieu
Heckman imputation models for binary or continuous MNAR outcomes and MAR predictors
title Heckman imputation models for binary or continuous MNAR outcomes and MAR predictors
title_full Heckman imputation models for binary or continuous MNAR outcomes and MAR predictors
title_fullStr Heckman imputation models for binary or continuous MNAR outcomes and MAR predictors
title_full_unstemmed Heckman imputation models for binary or continuous MNAR outcomes and MAR predictors
title_short Heckman imputation models for binary or continuous MNAR outcomes and MAR predictors
title_sort heckman imputation models for binary or continuous mnar outcomes and mar predictors
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6119269/
https://www.ncbi.nlm.nih.gov/pubmed/30170561
http://dx.doi.org/10.1186/s12874-018-0547-1
work_keys_str_mv AT galimardjacquesemmanuel heckmanimputationmodelsforbinaryorcontinuousmnaroutcomesandmarpredictors
AT chevretsylvie heckmanimputationmodelsforbinaryorcontinuousmnaroutcomesandmarpredictors
AT curisemmanuel heckmanimputationmodelsforbinaryorcontinuousmnaroutcomesandmarpredictors
AT rescherigonmatthieu heckmanimputationmodelsforbinaryorcontinuousmnaroutcomesandmarpredictors