Cargando…
A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection
BACKGROUND: Environmental and biomedical researchers frequently encounter laboratory data constrained by a lower limit of detection (LOD). Commonly used methods to address these left-censored data, such as simple substitution of a constant for all values < LOD, may bias parameter estimation. In c...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
National Institute of Environmental Health Sciences
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3059998/ https://www.ncbi.nlm.nih.gov/pubmed/21097385 http://dx.doi.org/10.1289/ehp.1002124 |
_version_ | 1782200477673324544 |
---|---|
author | Chen, Haiying Quandt, Sara A. Grzywacz, Joseph G. Arcury, Thomas A. |
author_facet | Chen, Haiying Quandt, Sara A. Grzywacz, Joseph G. Arcury, Thomas A. |
author_sort | Chen, Haiying |
collection | PubMed |
description | BACKGROUND: Environmental and biomedical researchers frequently encounter laboratory data constrained by a lower limit of detection (LOD). Commonly used methods to address these left-censored data, such as simple substitution of a constant for all values < LOD, may bias parameter estimation. In contrast, multiple imputation (MI) methods yield valid and robust parameter estimates and explicit imputed values for variables that can be analyzed as outcomes or predictors. OBJECTIVE: In this article we expand distribution-based MI methods for left-censored data to a bivariate setting, specifically, a longitudinal study with biological measures at two points in time. METHODS: We have presented the likelihood function for a bivariate normal distribution taking into account values < LOD as well as missing data assumed missing at random, and we use the estimated distributional parameters to impute values < LOD and to generate multiple plausible data sets for analysis by standard statistical methods. We conducted a simulation study to evaluate the sampling properties of the estimators, and we illustrate a practical application using data from the Community Participatory Approach to Measuring Farmworker Pesticide Exposure (PACE3) study to estimate associations between urinary acephate (APE) concentrations (indicating pesticide exposure) at two points in time and self-reported symptoms. RESULTS: Simulation study results demonstrated that imputed and observed values together were consistent with the assumed and estimated underlying distribution. Our analysis of PACE3 data using MI to impute APE values < LOD showed that urinary APE concentration was significantly associated with potential pesticide poisoning symptoms. Results based on simple substitution methods were substantially different from those based on the MI method. CONCLUSIONS: The distribution-based MI method is a valid and feasible approach to analyze bivariate data with values < LOD, especially when explicit values for the nondetections are needed. We recommend the use of this approach in environmental and biomedical research. |
format | Text |
id | pubmed-3059998 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | National Institute of Environmental Health Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-30599982011-03-21 A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection Chen, Haiying Quandt, Sara A. Grzywacz, Joseph G. Arcury, Thomas A. Environ Health Perspect Research BACKGROUND: Environmental and biomedical researchers frequently encounter laboratory data constrained by a lower limit of detection (LOD). Commonly used methods to address these left-censored data, such as simple substitution of a constant for all values < LOD, may bias parameter estimation. In contrast, multiple imputation (MI) methods yield valid and robust parameter estimates and explicit imputed values for variables that can be analyzed as outcomes or predictors. OBJECTIVE: In this article we expand distribution-based MI methods for left-censored data to a bivariate setting, specifically, a longitudinal study with biological measures at two points in time. METHODS: We have presented the likelihood function for a bivariate normal distribution taking into account values < LOD as well as missing data assumed missing at random, and we use the estimated distributional parameters to impute values < LOD and to generate multiple plausible data sets for analysis by standard statistical methods. We conducted a simulation study to evaluate the sampling properties of the estimators, and we illustrate a practical application using data from the Community Participatory Approach to Measuring Farmworker Pesticide Exposure (PACE3) study to estimate associations between urinary acephate (APE) concentrations (indicating pesticide exposure) at two points in time and self-reported symptoms. RESULTS: Simulation study results demonstrated that imputed and observed values together were consistent with the assumed and estimated underlying distribution. Our analysis of PACE3 data using MI to impute APE values < LOD showed that urinary APE concentration was significantly associated with potential pesticide poisoning symptoms. Results based on simple substitution methods were substantially different from those based on the MI method. CONCLUSIONS: The distribution-based MI method is a valid and feasible approach to analyze bivariate data with values < LOD, especially when explicit values for the nondetections are needed. We recommend the use of this approach in environmental and biomedical research. National Institute of Environmental Health Sciences 2011-03 2010-11-19 /pmc/articles/PMC3059998/ /pubmed/21097385 http://dx.doi.org/10.1289/ehp.1002124 Text en http://creativecommons.org/publicdomain/mark/1.0/ Publication of EHP lies in the public domain and is therefore without copyright. All text from EHP may be reprinted freely. Use of materials published in EHP should be acknowledged (for example, ?Reproduced with permission from Environmental Health Perspectives?); pertinent reference information should be provided for the article from which the material was reproduced. Articles from EHP, especially the News section, may contain photographs or illustrations copyrighted by other commercial organizations or individuals that may not be used without obtaining prior approval from the holder of the copyright. |
spellingShingle | Research Chen, Haiying Quandt, Sara A. Grzywacz, Joseph G. Arcury, Thomas A. A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection |
title | A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection |
title_full | A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection |
title_fullStr | A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection |
title_full_unstemmed | A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection |
title_short | A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection |
title_sort | distribution-based multiple imputation method for handling bivariate pesticide data with values below the limit of detection |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3059998/ https://www.ncbi.nlm.nih.gov/pubmed/21097385 http://dx.doi.org/10.1289/ehp.1002124 |
work_keys_str_mv | AT chenhaiying adistributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection AT quandtsaraa adistributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection AT grzywaczjosephg adistributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection AT arcurythomasa adistributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection AT chenhaiying distributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection AT quandtsaraa distributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection AT grzywaczjosephg distributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection AT arcurythomasa distributionbasedmultipleimputationmethodforhandlingbivariatepesticidedatawithvaluesbelowthelimitofdetection |