Cargando…

A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection

BACKGROUND: Environmental and biomedical researchers frequently encounter laboratory data constrained by a lower limit of detection (LOD). Commonly used methods to address these left-censored data, such as simple substitution of a constant for all values < LOD, may bias parameter estimation. In c...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Haiying, Quandt, Sara A., Grzywacz, Joseph G., Arcury, Thomas A.
Formato: Texto
Lenguaje:English
Publicado: National Institute of Environmental Health Sciences 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3059998/
https://www.ncbi.nlm.nih.gov/pubmed/21097385
http://dx.doi.org/10.1289/ehp.1002124
_version_ 1782200477673324544
author Chen, Haiying
Quandt, Sara A.
Grzywacz, Joseph G.
Arcury, Thomas A.
author_facet Chen, Haiying
Quandt, Sara A.
Grzywacz, Joseph G.
Arcury, Thomas A.
author_sort Chen, Haiying
collection PubMed
description BACKGROUND: Environmental and biomedical researchers frequently encounter laboratory data constrained by a lower limit of detection (LOD). Commonly used methods to address these left-censored data, such as simple substitution of a constant for all values < LOD, may bias parameter estimation. In contrast, multiple imputation (MI) methods yield valid and robust parameter estimates and explicit imputed values for variables that can be analyzed as outcomes or predictors. OBJECTIVE: In this article we expand distribution-based MI methods for left-censored data to a bivariate setting, specifically, a longitudinal study with biological measures at two points in time. METHODS: We have presented the likelihood function for a bivariate normal distribution taking into account values < LOD as well as missing data assumed missing at random, and we use the estimated distributional parameters to impute values < LOD and to generate multiple plausible data sets for analysis by standard statistical methods. We conducted a simulation study to evaluate the sampling properties of the estimators, and we illustrate a practical application using data from the Community Participatory Approach to Measuring Farmworker Pesticide Exposure (PACE3) study to estimate associations between urinary acephate (APE) concentrations (indicating pesticide exposure) at two points in time and self-reported symptoms. RESULTS: Simulation study results demonstrated that imputed and observed values together were consistent with the assumed and estimated underlying distribution. Our analysis of PACE3 data using MI to impute APE values < LOD showed that urinary APE concentration was significantly associated with potential pesticide poisoning symptoms. Results based on simple substitution methods were substantially different from those based on the MI method. CONCLUSIONS: The distribution-based MI method is a valid and feasible approach to analyze bivariate data with values < LOD, especially when explicit values for the nondetections are needed. We recommend the use of this approach in environmental and biomedical research.
format Text
id pubmed-3059998
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher National Institute of Environmental Health Sciences
record_format MEDLINE/PubMed
spelling pubmed-30599982011-03-21 A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection Chen, Haiying Quandt, Sara A. Grzywacz, Joseph G. Arcury, Thomas A. Environ Health Perspect Research BACKGROUND: Environmental and biomedical researchers frequently encounter laboratory data constrained by a lower limit of detection (LOD). Commonly used methods to address these left-censored data, such as simple substitution of a constant for all values < LOD, may bias parameter estimation. In contrast, multiple imputation (MI) methods yield valid and robust parameter estimates and explicit imputed values for variables that can be analyzed as outcomes or predictors. OBJECTIVE: In this article we expand distribution-based MI methods for left-censored data to a bivariate setting, specifically, a longitudinal study with biological measures at two points in time. METHODS: We have presented the likelihood function for a bivariate normal distribution taking into account values < LOD as well as missing data assumed missing at random, and we use the estimated distributional parameters to impute values < LOD and to generate multiple plausible data sets for analysis by standard statistical methods. We conducted a simulation study to evaluate the sampling properties of the estimators, and we illustrate a practical application using data from the Community Participatory Approach to Measuring Farmworker Pesticide Exposure (PACE3) study to estimate associations between urinary acephate (APE) concentrations (indicating pesticide exposure) at two points in time and self-reported symptoms. RESULTS: Simulation study results demonstrated that imputed and observed values together were consistent with the assumed and estimated underlying distribution. Our analysis of PACE3 data using MI to impute APE values < LOD showed that urinary APE concentration was significantly associated with potential pesticide poisoning symptoms. Results based on simple substitution methods were substantially different from those based on the MI method. CONCLUSIONS: The distribution-based MI method is a valid and feasible approach to analyze bivariate data with values < LOD, especially when explicit values for the nondetections are needed. We recommend the use of this approach in environmental and biomedical research. National Institute of Environmental Health Sciences 2011-03 2010-11-19 /pmc/articles/PMC3059998/ /pubmed/21097385 http://dx.doi.org/10.1289/ehp.1002124 Text en http://creativecommons.org/publicdomain/mark/1.0/ Publication of EHP lies in the public domain and is therefore without copyright. All text from EHP may be reprinted freely. Use of materials published in EHP should be acknowledged (for example, ?Reproduced with permission from Environmental Health Perspectives?); pertinent reference information should be provided for the article from which the material was reproduced. Articles from EHP, especially the News section, may contain photographs or illustrations copyrighted by other commercial organizations or individuals that may not be used without obtaining prior approval from the holder of the copyright.
spellingShingle Research
Chen, Haiying
Quandt, Sara A.
Grzywacz, Joseph G.
Arcury, Thomas A.
A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection
title A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection
title_full A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection
title_fullStr A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection
title_full_unstemmed A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection
title_short A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection
title_sort distribution-based multiple imputation method for handling bivariate pesticide data with values below the limit of detection
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3059998/
https://www.ncbi.nlm.nih.gov/pubmed/21097385
http://dx.doi.org/10.1289/ehp.1002124
work_keys_str_mv AT chenhaiying adistributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection
AT quandtsaraa adistributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection
AT grzywaczjosephg adistributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection
AT arcurythomasa adistributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection
AT chenhaiying distributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection
AT quandtsaraa distributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection
AT grzywaczjosephg distributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection
AT arcurythomasa distributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection