Cargando…
Multiple imputation for an incomplete covariate that is a ratio
We are concerned with multiple imputation of the ratio of two variables, which is to be used as a covariate in a regression analysis. If the numerator and denominator are not missing simultaneously, it seems sensible to make use of the observed variable in the imputation model. One such strategy is...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley & Sons Ltd
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3920636/ https://www.ncbi.nlm.nih.gov/pubmed/23922236 http://dx.doi.org/10.1002/sim.5935 |
_version_ | 1782303204296359936 |
---|---|
author | Morris, Tim P White, Ian R Royston, Patrick Seaman, Shaun R Wood, Angela M |
author_facet | Morris, Tim P White, Ian R Royston, Patrick Seaman, Shaun R Wood, Angela M |
author_sort | Morris, Tim P |
collection | PubMed |
description | We are concerned with multiple imputation of the ratio of two variables, which is to be used as a covariate in a regression analysis. If the numerator and denominator are not missing simultaneously, it seems sensible to make use of the observed variable in the imputation model. One such strategy is to impute missing values for the numerator and denominator, or the log-transformed numerator and denominator, and then calculate the ratio of interest; we call this ‘passive’ imputation. Alternatively, missing ratio values might be imputed directly, with or without the numerator and/or the denominator in the imputation model; we call this ‘active’ imputation. In two motivating datasets, one involving body mass index as a covariate and the other involving the ratio of total to high-density lipoprotein cholesterol, we assess the sensitivity of results to the choice of imputation model and, as an alternative, explore fully Bayesian joint models for the outcome and incomplete ratio. Fully Bayesian approaches using Winbugs were unusable in both datasets because of computational problems. In our first dataset, multiple imputation results are similar regardless of the imputation model; in the second, results are sensitive to the choice of imputation model. Sensitivity depends strongly on the coefficient of variation of the ratio's denominator. A simulation study demonstrates that passive imputation without transformation is risky because it can lead to downward bias when the coefficient of variation of the ratio's denominator is larger than about 0.1. Active imputation or passive imputation after log-transformation is preferable. © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. |
format | Online Article Text |
id | pubmed-3920636 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | John Wiley & Sons Ltd |
record_format | MEDLINE/PubMed |
spelling | pubmed-39206362014-02-19 Multiple imputation for an incomplete covariate that is a ratio Morris, Tim P White, Ian R Royston, Patrick Seaman, Shaun R Wood, Angela M Stat Med Research Articles We are concerned with multiple imputation of the ratio of two variables, which is to be used as a covariate in a regression analysis. If the numerator and denominator are not missing simultaneously, it seems sensible to make use of the observed variable in the imputation model. One such strategy is to impute missing values for the numerator and denominator, or the log-transformed numerator and denominator, and then calculate the ratio of interest; we call this ‘passive’ imputation. Alternatively, missing ratio values might be imputed directly, with or without the numerator and/or the denominator in the imputation model; we call this ‘active’ imputation. In two motivating datasets, one involving body mass index as a covariate and the other involving the ratio of total to high-density lipoprotein cholesterol, we assess the sensitivity of results to the choice of imputation model and, as an alternative, explore fully Bayesian joint models for the outcome and incomplete ratio. Fully Bayesian approaches using Winbugs were unusable in both datasets because of computational problems. In our first dataset, multiple imputation results are similar regardless of the imputation model; in the second, results are sensitive to the choice of imputation model. Sensitivity depends strongly on the coefficient of variation of the ratio's denominator. A simulation study demonstrates that passive imputation without transformation is risky because it can lead to downward bias when the coefficient of variation of the ratio's denominator is larger than about 0.1. Active imputation or passive imputation after log-transformation is preferable. © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. John Wiley & Sons Ltd 2014-01-15 2013-08-06 /pmc/articles/PMC3920636/ /pubmed/23922236 http://dx.doi.org/10.1002/sim.5935 Text en © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. http://creativecommons.org/licenses/by/3.0/ This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Morris, Tim P White, Ian R Royston, Patrick Seaman, Shaun R Wood, Angela M Multiple imputation for an incomplete covariate that is a ratio |
title | Multiple imputation for an incomplete covariate that is a ratio |
title_full | Multiple imputation for an incomplete covariate that is a ratio |
title_fullStr | Multiple imputation for an incomplete covariate that is a ratio |
title_full_unstemmed | Multiple imputation for an incomplete covariate that is a ratio |
title_short | Multiple imputation for an incomplete covariate that is a ratio |
title_sort | multiple imputation for an incomplete covariate that is a ratio |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3920636/ https://www.ncbi.nlm.nih.gov/pubmed/23922236 http://dx.doi.org/10.1002/sim.5935 |
work_keys_str_mv | AT morristimp multipleimputationforanincompletecovariatethatisaratio AT whiteianr multipleimputationforanincompletecovariatethatisaratio AT roystonpatrick multipleimputationforanincompletecovariatethatisaratio AT seamanshaunr multipleimputationforanincompletecovariatethatisaratio AT woodangelam multipleimputationforanincompletecovariatethatisaratio |