Cargando…

Combining Probability and Nonprobability Samples by Using Multivariate Mass Imputation Approaches with Application to Biomedical Research

Nonprobability samples have been used frequently in practice including public health study, economics, education, and political polls. Naïve estimates based on nonprobability samples without any further adjustments may suffer from serious selection bias. Mass imputation has been shown to be effectiv...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Sixia, Woodruff, Alexandra May, Campbell, Janis, Vesely, Sara, Xu, Zheng, Snider, Cuyler
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10609654/
https://www.ncbi.nlm.nih.gov/pubmed/37901444
http://dx.doi.org/10.3390/stats6020039
_version_ 1785128064175308800
author Chen, Sixia
Woodruff, Alexandra May
Campbell, Janis
Vesely, Sara
Xu, Zheng
Snider, Cuyler
author_facet Chen, Sixia
Woodruff, Alexandra May
Campbell, Janis
Vesely, Sara
Xu, Zheng
Snider, Cuyler
author_sort Chen, Sixia
collection PubMed
description Nonprobability samples have been used frequently in practice including public health study, economics, education, and political polls. Naïve estimates based on nonprobability samples without any further adjustments may suffer from serious selection bias. Mass imputation has been shown to be effective in practice to improve the representativeness of nonprobability samples. It builds an imputation model based on nonprobability samples and generates imputed values for all units in the probability samples. In this paper, we compare two mass imputation approaches including latent joint multivariate normal model mass imputation (e.g., Generalized Efficient Regression-Based Imputation with Latent Processes (GERBIL)) and fully conditional specification (FCS) procedures for integrating multiple outcome variables simultaneously. The Monte Carlo simulation study shows the benefits of GERBIL and FCS with predictive mean matching in terms of balancing the Monte Carlo bias and variance. We further evaluate our proposed method by combining the information from Tribal Behavioral Risk Factor Surveillance System and Behavioral Risk Factor Surveillance System data files.
format Online
Article
Text
id pubmed-10609654
institution National Center for Biotechnology Information
language English
publishDate 2023
record_format MEDLINE/PubMed
spelling pubmed-106096542023-10-27 Combining Probability and Nonprobability Samples by Using Multivariate Mass Imputation Approaches with Application to Biomedical Research Chen, Sixia Woodruff, Alexandra May Campbell, Janis Vesely, Sara Xu, Zheng Snider, Cuyler Stats (Basel) Article Nonprobability samples have been used frequently in practice including public health study, economics, education, and political polls. Naïve estimates based on nonprobability samples without any further adjustments may suffer from serious selection bias. Mass imputation has been shown to be effective in practice to improve the representativeness of nonprobability samples. It builds an imputation model based on nonprobability samples and generates imputed values for all units in the probability samples. In this paper, we compare two mass imputation approaches including latent joint multivariate normal model mass imputation (e.g., Generalized Efficient Regression-Based Imputation with Latent Processes (GERBIL)) and fully conditional specification (FCS) procedures for integrating multiple outcome variables simultaneously. The Monte Carlo simulation study shows the benefits of GERBIL and FCS with predictive mean matching in terms of balancing the Monte Carlo bias and variance. We further evaluate our proposed method by combining the information from Tribal Behavioral Risk Factor Surveillance System and Behavioral Risk Factor Surveillance System data files. 2023-06 2023-05-08 /pmc/articles/PMC10609654/ /pubmed/37901444 http://dx.doi.org/10.3390/stats6020039 Text en https://creativecommons.org/licenses/by/4.0/This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/)
spellingShingle Article
Chen, Sixia
Woodruff, Alexandra May
Campbell, Janis
Vesely, Sara
Xu, Zheng
Snider, Cuyler
Combining Probability and Nonprobability Samples by Using Multivariate Mass Imputation Approaches with Application to Biomedical Research
title Combining Probability and Nonprobability Samples by Using Multivariate Mass Imputation Approaches with Application to Biomedical Research
title_full Combining Probability and Nonprobability Samples by Using Multivariate Mass Imputation Approaches with Application to Biomedical Research
title_fullStr Combining Probability and Nonprobability Samples by Using Multivariate Mass Imputation Approaches with Application to Biomedical Research
title_full_unstemmed Combining Probability and Nonprobability Samples by Using Multivariate Mass Imputation Approaches with Application to Biomedical Research
title_short Combining Probability and Nonprobability Samples by Using Multivariate Mass Imputation Approaches with Application to Biomedical Research
title_sort combining probability and nonprobability samples by using multivariate mass imputation approaches with application to biomedical research
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10609654/
https://www.ncbi.nlm.nih.gov/pubmed/37901444
http://dx.doi.org/10.3390/stats6020039
work_keys_str_mv AT chensixia combiningprobabilityandnonprobabilitysamplesbyusingmultivariatemassimputationapproacheswithapplicationtobiomedicalresearch
AT woodruffalexandramay combiningprobabilityandnonprobabilitysamplesbyusingmultivariatemassimputationapproacheswithapplicationtobiomedicalresearch
AT campbelljanis combiningprobabilityandnonprobabilitysamplesbyusingmultivariatemassimputationapproacheswithapplicationtobiomedicalresearch
AT veselysara combiningprobabilityandnonprobabilitysamplesbyusingmultivariatemassimputationapproacheswithapplicationtobiomedicalresearch
AT xuzheng combiningprobabilityandnonprobabilitysamplesbyusingmultivariatemassimputationapproacheswithapplicationtobiomedicalresearch
AT snidercuyler combiningprobabilityandnonprobabilitysamplesbyusingmultivariatemassimputationapproacheswithapplicationtobiomedicalresearch