Cargando…
Missing data approaches for probability regression models with missing outcomes with applications
In this paper, we investigate several well known approaches for missing data and their relationships for the parametric probability regression model P(β)(Y|X) when outcome of interest Y is subject to missingness. We explore the relationships between the mean score method, the inverse probability wei...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4757472/ https://www.ncbi.nlm.nih.gov/pubmed/26900543 http://dx.doi.org/10.1186/s40488-014-0023-3 |
_version_ | 1782416459934203904 |
---|---|
author | Qi, Li Sun, Yanqing |
author_facet | Qi, Li Sun, Yanqing |
author_sort | Qi, Li |
collection | PubMed |
description | In this paper, we investigate several well known approaches for missing data and their relationships for the parametric probability regression model P(β)(Y|X) when outcome of interest Y is subject to missingness. We explore the relationships between the mean score method, the inverse probability weighting (IPW) method and the augmented inverse probability weighted (AIPW) method with some interesting findings. The asymptotic distributions of the IPW and AIPW estimators are derived and their efficiencies are compared. Our analysis details how efficiency may be gained from the AIPW estimator over the IPW estimator through estimation of validation probability and augmentation. We show that the AIPW estimator that is based on augmentation using the full set of observed variables is more efficient than the AIPW estimator that is based on augmentation using a subset of observed variables. The developed approaches are applied to Poisson regression model with missing outcomes based on auxiliary outcomes and a validated sample for true outcomes. We show that, by stratifying based on a set of discrete variables, the proposed statistical procedure can be formulated to analyze automated records that only contain summarized information at categorical levels. The proposed methods are applied to analyze influenza vaccine efficacy for an influenza vaccine study conducted in Temple-Belton, Texas during the 2000-2001 influenza season. |
format | Online Article Text |
id | pubmed-4757472 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
record_format | MEDLINE/PubMed |
spelling | pubmed-47574722016-02-17 Missing data approaches for probability regression models with missing outcomes with applications Qi, Li Sun, Yanqing J Stat Distrib Appl Article In this paper, we investigate several well known approaches for missing data and their relationships for the parametric probability regression model P(β)(Y|X) when outcome of interest Y is subject to missingness. We explore the relationships between the mean score method, the inverse probability weighting (IPW) method and the augmented inverse probability weighted (AIPW) method with some interesting findings. The asymptotic distributions of the IPW and AIPW estimators are derived and their efficiencies are compared. Our analysis details how efficiency may be gained from the AIPW estimator over the IPW estimator through estimation of validation probability and augmentation. We show that the AIPW estimator that is based on augmentation using the full set of observed variables is more efficient than the AIPW estimator that is based on augmentation using a subset of observed variables. The developed approaches are applied to Poisson regression model with missing outcomes based on auxiliary outcomes and a validated sample for true outcomes. We show that, by stratifying based on a set of discrete variables, the proposed statistical procedure can be formulated to analyze automated records that only contain summarized information at categorical levels. The proposed methods are applied to analyze influenza vaccine efficacy for an influenza vaccine study conducted in Temple-Belton, Texas during the 2000-2001 influenza season. 2014-12-16 2014 /pmc/articles/PMC4757472/ /pubmed/26900543 http://dx.doi.org/10.1186/s40488-014-0023-3 Text en http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. |
spellingShingle | Article Qi, Li Sun, Yanqing Missing data approaches for probability regression models with missing outcomes with applications |
title | Missing data approaches for probability regression models with
missing outcomes with applications |
title_full | Missing data approaches for probability regression models with
missing outcomes with applications |
title_fullStr | Missing data approaches for probability regression models with
missing outcomes with applications |
title_full_unstemmed | Missing data approaches for probability regression models with
missing outcomes with applications |
title_short | Missing data approaches for probability regression models with
missing outcomes with applications |
title_sort | missing data approaches for probability regression models with
missing outcomes with applications |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4757472/ https://www.ncbi.nlm.nih.gov/pubmed/26900543 http://dx.doi.org/10.1186/s40488-014-0023-3 |
work_keys_str_mv | AT qili missingdataapproachesforprobabilityregressionmodelswithmissingoutcomeswithapplications AT sunyanqing missingdataapproachesforprobabilityregressionmodelswithmissingoutcomeswithapplications |