Cargando…

Multiple imputation for an incomplete covariate that is a ratio

We are concerned with multiple imputation of the ratio of two variables, which is to be used as a covariate in a regression analysis. If the numerator and denominator are not missing simultaneously, it seems sensible to make use of the observed variable in the imputation model. One such strategy is...

Descripción completa

Detalles Bibliográficos
Autores principales: Morris, Tim P, White, Ian R, Royston, Patrick, Seaman, Shaun R, Wood, Angela M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons Ltd 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3920636/
https://www.ncbi.nlm.nih.gov/pubmed/23922236
http://dx.doi.org/10.1002/sim.5935
_version_ 1782303204296359936
author Morris, Tim P
White, Ian R
Royston, Patrick
Seaman, Shaun R
Wood, Angela M
author_facet Morris, Tim P
White, Ian R
Royston, Patrick
Seaman, Shaun R
Wood, Angela M
author_sort Morris, Tim P
collection PubMed
description We are concerned with multiple imputation of the ratio of two variables, which is to be used as a covariate in a regression analysis. If the numerator and denominator are not missing simultaneously, it seems sensible to make use of the observed variable in the imputation model. One such strategy is to impute missing values for the numerator and denominator, or the log-transformed numerator and denominator, and then calculate the ratio of interest; we call this ‘passive’ imputation. Alternatively, missing ratio values might be imputed directly, with or without the numerator and/or the denominator in the imputation model; we call this ‘active’ imputation. In two motivating datasets, one involving body mass index as a covariate and the other involving the ratio of total to high-density lipoprotein cholesterol, we assess the sensitivity of results to the choice of imputation model and, as an alternative, explore fully Bayesian joint models for the outcome and incomplete ratio. Fully Bayesian approaches using Winbugs were unusable in both datasets because of computational problems. In our first dataset, multiple imputation results are similar regardless of the imputation model; in the second, results are sensitive to the choice of imputation model. Sensitivity depends strongly on the coefficient of variation of the ratio's denominator. A simulation study demonstrates that passive imputation without transformation is risky because it can lead to downward bias when the coefficient of variation of the ratio's denominator is larger than about 0.1. Active imputation or passive imputation after log-transformation is preferable. © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd.
format Online
Article
Text
id pubmed-3920636
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher John Wiley & Sons Ltd
record_format MEDLINE/PubMed
spelling pubmed-39206362014-02-19 Multiple imputation for an incomplete covariate that is a ratio Morris, Tim P White, Ian R Royston, Patrick Seaman, Shaun R Wood, Angela M Stat Med Research Articles We are concerned with multiple imputation of the ratio of two variables, which is to be used as a covariate in a regression analysis. If the numerator and denominator are not missing simultaneously, it seems sensible to make use of the observed variable in the imputation model. One such strategy is to impute missing values for the numerator and denominator, or the log-transformed numerator and denominator, and then calculate the ratio of interest; we call this ‘passive’ imputation. Alternatively, missing ratio values might be imputed directly, with or without the numerator and/or the denominator in the imputation model; we call this ‘active’ imputation. In two motivating datasets, one involving body mass index as a covariate and the other involving the ratio of total to high-density lipoprotein cholesterol, we assess the sensitivity of results to the choice of imputation model and, as an alternative, explore fully Bayesian joint models for the outcome and incomplete ratio. Fully Bayesian approaches using Winbugs were unusable in both datasets because of computational problems. In our first dataset, multiple imputation results are similar regardless of the imputation model; in the second, results are sensitive to the choice of imputation model. Sensitivity depends strongly on the coefficient of variation of the ratio's denominator. A simulation study demonstrates that passive imputation without transformation is risky because it can lead to downward bias when the coefficient of variation of the ratio's denominator is larger than about 0.1. Active imputation or passive imputation after log-transformation is preferable. © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. John Wiley & Sons Ltd 2014-01-15 2013-08-06 /pmc/articles/PMC3920636/ /pubmed/23922236 http://dx.doi.org/10.1002/sim.5935 Text en © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. http://creativecommons.org/licenses/by/3.0/ This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Morris, Tim P
White, Ian R
Royston, Patrick
Seaman, Shaun R
Wood, Angela M
Multiple imputation for an incomplete covariate that is a ratio
title Multiple imputation for an incomplete covariate that is a ratio
title_full Multiple imputation for an incomplete covariate that is a ratio
title_fullStr Multiple imputation for an incomplete covariate that is a ratio
title_full_unstemmed Multiple imputation for an incomplete covariate that is a ratio
title_short Multiple imputation for an incomplete covariate that is a ratio
title_sort multiple imputation for an incomplete covariate that is a ratio
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3920636/
https://www.ncbi.nlm.nih.gov/pubmed/23922236
http://dx.doi.org/10.1002/sim.5935
work_keys_str_mv AT morristimp multipleimputationforanincompletecovariatethatisaratio
AT whiteianr multipleimputationforanincompletecovariatethatisaratio
AT roystonpatrick multipleimputationforanincompletecovariatethatisaratio
AT seamanshaunr multipleimputationforanincompletecovariatethatisaratio
AT woodangelam multipleimputationforanincompletecovariatethatisaratio