Cargando…

Joint modelling rationale for chained equations

BACKGROUND: Chained equations imputation is widely used in medical research. It uses a set of conditional models, so is more flexible than joint modelling imputation for the imputation of different types of variables (e.g. binary, ordinal or unordered categorical). However, chained equations imputat...

Descripción completa

Detalles Bibliográficos
Autores principales: Hughes, Rachael A, White, Ian R, Seaman, Shaun R, Carpenter, James R, Tilling, Kate, Sterne, Jonathan AC
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3936896/
https://www.ncbi.nlm.nih.gov/pubmed/24559129
http://dx.doi.org/10.1186/1471-2288-14-28
_version_ 1782305385340731392
author Hughes, Rachael A
White, Ian R
Seaman, Shaun R
Carpenter, James R
Tilling, Kate
Sterne, Jonathan AC
author_facet Hughes, Rachael A
White, Ian R
Seaman, Shaun R
Carpenter, James R
Tilling, Kate
Sterne, Jonathan AC
author_sort Hughes, Rachael A
collection PubMed
description BACKGROUND: Chained equations imputation is widely used in medical research. It uses a set of conditional models, so is more flexible than joint modelling imputation for the imputation of different types of variables (e.g. binary, ordinal or unordered categorical). However, chained equations imputation does not correspond to drawing from a joint distribution when the conditional models are incompatible. Concurrently with our work, other authors have shown the equivalence of the two imputation methods in finite samples. METHODS: Taking a different approach, we prove, in finite samples, sufficient conditions for chained equations and joint modelling to yield imputations from the same predictive distribution. Further, we apply this proof in four specific cases and conduct a simulation study which explores the consequences when the conditional models are compatible but the conditions otherwise are not satisfied. RESULTS: We provide an additional “non-informative margins” condition which, together with compatibility, is sufficient. We show that the non-informative margins condition is not satisfied, despite compatible conditional models, in a situation as simple as two continuous variables and one binary variable. Our simulation study demonstrates that as a consequence of this violation order effects can occur; that is, systematic differences depending upon the ordering of the variables in the chained equations algorithm. However, the order effects appear to be small, especially when associations between variables are weak. CONCLUSIONS: Since chained equations is typically used in medical research for datasets with different types of variables, researchers must be aware that order effects are likely to be ubiquitous, but our results suggest they may be small enough to be negligible.
format Online
Article
Text
id pubmed-3936896
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39368962014-03-06 Joint modelling rationale for chained equations Hughes, Rachael A White, Ian R Seaman, Shaun R Carpenter, James R Tilling, Kate Sterne, Jonathan AC BMC Med Res Methodol Research Article BACKGROUND: Chained equations imputation is widely used in medical research. It uses a set of conditional models, so is more flexible than joint modelling imputation for the imputation of different types of variables (e.g. binary, ordinal or unordered categorical). However, chained equations imputation does not correspond to drawing from a joint distribution when the conditional models are incompatible. Concurrently with our work, other authors have shown the equivalence of the two imputation methods in finite samples. METHODS: Taking a different approach, we prove, in finite samples, sufficient conditions for chained equations and joint modelling to yield imputations from the same predictive distribution. Further, we apply this proof in four specific cases and conduct a simulation study which explores the consequences when the conditional models are compatible but the conditions otherwise are not satisfied. RESULTS: We provide an additional “non-informative margins” condition which, together with compatibility, is sufficient. We show that the non-informative margins condition is not satisfied, despite compatible conditional models, in a situation as simple as two continuous variables and one binary variable. Our simulation study demonstrates that as a consequence of this violation order effects can occur; that is, systematic differences depending upon the ordering of the variables in the chained equations algorithm. However, the order effects appear to be small, especially when associations between variables are weak. CONCLUSIONS: Since chained equations is typically used in medical research for datasets with different types of variables, researchers must be aware that order effects are likely to be ubiquitous, but our results suggest they may be small enough to be negligible. BioMed Central 2014-02-21 /pmc/articles/PMC3936896/ /pubmed/24559129 http://dx.doi.org/10.1186/1471-2288-14-28 Text en Copyright © 2014 Hughes et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hughes, Rachael A
White, Ian R
Seaman, Shaun R
Carpenter, James R
Tilling, Kate
Sterne, Jonathan AC
Joint modelling rationale for chained equations
title Joint modelling rationale for chained equations
title_full Joint modelling rationale for chained equations
title_fullStr Joint modelling rationale for chained equations
title_full_unstemmed Joint modelling rationale for chained equations
title_short Joint modelling rationale for chained equations
title_sort joint modelling rationale for chained equations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3936896/
https://www.ncbi.nlm.nih.gov/pubmed/24559129
http://dx.doi.org/10.1186/1471-2288-14-28
work_keys_str_mv AT hughesrachaela jointmodellingrationaleforchainedequations
AT whiteianr jointmodellingrationaleforchainedequations
AT seamanshaunr jointmodellingrationaleforchainedequations
AT carpenterjamesr jointmodellingrationaleforchainedequations
AT tillingkate jointmodellingrationaleforchainedequations
AT sternejonathanac jointmodellingrationaleforchainedequations