Cargando…
How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches
Non-detectable (ND) and outlying concentration values (OV) are a common challenge of biomarker investigations. However, best practices on how to aptly deal with the affected cases are still missing. The high methodological heterogeneity in biomarker-oriented research, as for example, in the field of...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9216349/ https://www.ncbi.nlm.nih.gov/pubmed/35757062 http://dx.doi.org/10.1016/j.cpnec.2021.100052 |
_version_ | 1784731400697544704 |
---|---|
author | Herbers, Judith Miller, Robert Walther, Andreas Schindler, Lena Schmidt, Kornelius Gao, Wei Rupprecht, Florian |
author_facet | Herbers, Judith Miller, Robert Walther, Andreas Schindler, Lena Schmidt, Kornelius Gao, Wei Rupprecht, Florian |
author_sort | Herbers, Judith |
collection | PubMed |
description | Non-detectable (ND) and outlying concentration values (OV) are a common challenge of biomarker investigations. However, best practices on how to aptly deal with the affected cases are still missing. The high methodological heterogeneity in biomarker-oriented research, as for example, in the field of psychoneuroendocrinology, and the statistical bias in some of the applied methods may compromise the robustness, comparability, and generalizability of research findings. In this paper, we describe the occurrence of ND and OV in terms of a model that considers them as censored data, for instance due to measurement error cutoffs. We then present common univariate approaches in handling ND and OV by highlighting their respective strengths and drawbacks. In a simulation study with lognormal distributed data, we compare the performance of six selected methods, ranging from simple and commonly used to more sophisticated imputation procedures, in four scenarios with varying patterns of censored values as well as for a broad range of cutoffs. Especially deletion, but also fixed-value imputations bear a high risk of biased and pseudo-precise parameter estimates. We also introduce censored regressions as a more sophisticated option for a direct modeling of the censored data. Our analyses demonstrate the impact of ND and OV handling methods on the results of biomarker-oriented research, supporting the need for transparent reporting and the implementation of best practices. In our simulations, the use of imputed data from the censored intervals of a fitted lognormal distribution shows preferable properties regarding our established criteria. We provide the algorithm for this favored routine for a direct application in R on the Open Science Framework (https://osf.io/spgtv). Further research is needed to evaluate the performance of the algorithm in various contexts, for example when the underlying assumptions do not hold. We conclude with recommendations and potential further improvements for the field. |
format | Online Article Text |
id | pubmed-9216349 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-92163492022-06-24 How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches Herbers, Judith Miller, Robert Walther, Andreas Schindler, Lena Schmidt, Kornelius Gao, Wei Rupprecht, Florian Compr Psychoneuroendocrinol Review Non-detectable (ND) and outlying concentration values (OV) are a common challenge of biomarker investigations. However, best practices on how to aptly deal with the affected cases are still missing. The high methodological heterogeneity in biomarker-oriented research, as for example, in the field of psychoneuroendocrinology, and the statistical bias in some of the applied methods may compromise the robustness, comparability, and generalizability of research findings. In this paper, we describe the occurrence of ND and OV in terms of a model that considers them as censored data, for instance due to measurement error cutoffs. We then present common univariate approaches in handling ND and OV by highlighting their respective strengths and drawbacks. In a simulation study with lognormal distributed data, we compare the performance of six selected methods, ranging from simple and commonly used to more sophisticated imputation procedures, in four scenarios with varying patterns of censored values as well as for a broad range of cutoffs. Especially deletion, but also fixed-value imputations bear a high risk of biased and pseudo-precise parameter estimates. We also introduce censored regressions as a more sophisticated option for a direct modeling of the censored data. Our analyses demonstrate the impact of ND and OV handling methods on the results of biomarker-oriented research, supporting the need for transparent reporting and the implementation of best practices. In our simulations, the use of imputed data from the censored intervals of a fitted lognormal distribution shows preferable properties regarding our established criteria. We provide the algorithm for this favored routine for a direct application in R on the Open Science Framework (https://osf.io/spgtv). Further research is needed to evaluate the performance of the algorithm in various contexts, for example when the underlying assumptions do not hold. We conclude with recommendations and potential further improvements for the field. Elsevier 2021-03-29 /pmc/articles/PMC9216349/ /pubmed/35757062 http://dx.doi.org/10.1016/j.cpnec.2021.100052 Text en © 2021 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Review Herbers, Judith Miller, Robert Walther, Andreas Schindler, Lena Schmidt, Kornelius Gao, Wei Rupprecht, Florian How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches |
title | How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches |
title_full | How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches |
title_fullStr | How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches |
title_full_unstemmed | How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches |
title_short | How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches |
title_sort | how to deal with non-detectable and outlying values in biomarker research: best practices and recommendations for univariate imputation approaches |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9216349/ https://www.ncbi.nlm.nih.gov/pubmed/35757062 http://dx.doi.org/10.1016/j.cpnec.2021.100052 |
work_keys_str_mv | AT herbersjudith howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches AT millerrobert howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches AT waltherandreas howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches AT schindlerlena howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches AT schmidtkornelius howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches AT gaowei howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches AT rupprechtflorian howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches |