Cargando…

How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches

Non-detectable (ND) and outlying concentration values (OV) are a common challenge of biomarker investigations. However, best practices on how to aptly deal with the affected cases are still missing. The high methodological heterogeneity in biomarker-oriented research, as for example, in the field of...

Descripción completa

Detalles Bibliográficos
Autores principales: Herbers, Judith, Miller, Robert, Walther, Andreas, Schindler, Lena, Schmidt, Kornelius, Gao, Wei, Rupprecht, Florian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9216349/
https://www.ncbi.nlm.nih.gov/pubmed/35757062
http://dx.doi.org/10.1016/j.cpnec.2021.100052
_version_ 1784731400697544704
author Herbers, Judith
Miller, Robert
Walther, Andreas
Schindler, Lena
Schmidt, Kornelius
Gao, Wei
Rupprecht, Florian
author_facet Herbers, Judith
Miller, Robert
Walther, Andreas
Schindler, Lena
Schmidt, Kornelius
Gao, Wei
Rupprecht, Florian
author_sort Herbers, Judith
collection PubMed
description Non-detectable (ND) and outlying concentration values (OV) are a common challenge of biomarker investigations. However, best practices on how to aptly deal with the affected cases are still missing. The high methodological heterogeneity in biomarker-oriented research, as for example, in the field of psychoneuroendocrinology, and the statistical bias in some of the applied methods may compromise the robustness, comparability, and generalizability of research findings. In this paper, we describe the occurrence of ND and OV in terms of a model that considers them as censored data, for instance due to measurement error cutoffs. We then present common univariate approaches in handling ND and OV by highlighting their respective strengths and drawbacks. In a simulation study with lognormal distributed data, we compare the performance of six selected methods, ranging from simple and commonly used to more sophisticated imputation procedures, in four scenarios with varying patterns of censored values as well as for a broad range of cutoffs. Especially deletion, but also fixed-value imputations bear a high risk of biased and pseudo-precise parameter estimates. We also introduce censored regressions as a more sophisticated option for a direct modeling of the censored data. Our analyses demonstrate the impact of ND and OV handling methods on the results of biomarker-oriented research, supporting the need for transparent reporting and the implementation of best practices. In our simulations, the use of imputed data from the censored intervals of a fitted lognormal distribution shows preferable properties regarding our established criteria. We provide the algorithm for this favored routine for a direct application in R on the Open Science Framework (https://osf.io/spgtv). Further research is needed to evaluate the performance of the algorithm in various contexts, for example when the underlying assumptions do not hold. We conclude with recommendations and potential further improvements for the field.
format Online
Article
Text
id pubmed-9216349
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-92163492022-06-24 How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches Herbers, Judith Miller, Robert Walther, Andreas Schindler, Lena Schmidt, Kornelius Gao, Wei Rupprecht, Florian Compr Psychoneuroendocrinol Review Non-detectable (ND) and outlying concentration values (OV) are a common challenge of biomarker investigations. However, best practices on how to aptly deal with the affected cases are still missing. The high methodological heterogeneity in biomarker-oriented research, as for example, in the field of psychoneuroendocrinology, and the statistical bias in some of the applied methods may compromise the robustness, comparability, and generalizability of research findings. In this paper, we describe the occurrence of ND and OV in terms of a model that considers them as censored data, for instance due to measurement error cutoffs. We then present common univariate approaches in handling ND and OV by highlighting their respective strengths and drawbacks. In a simulation study with lognormal distributed data, we compare the performance of six selected methods, ranging from simple and commonly used to more sophisticated imputation procedures, in four scenarios with varying patterns of censored values as well as for a broad range of cutoffs. Especially deletion, but also fixed-value imputations bear a high risk of biased and pseudo-precise parameter estimates. We also introduce censored regressions as a more sophisticated option for a direct modeling of the censored data. Our analyses demonstrate the impact of ND and OV handling methods on the results of biomarker-oriented research, supporting the need for transparent reporting and the implementation of best practices. In our simulations, the use of imputed data from the censored intervals of a fitted lognormal distribution shows preferable properties regarding our established criteria. We provide the algorithm for this favored routine for a direct application in R on the Open Science Framework (https://osf.io/spgtv). Further research is needed to evaluate the performance of the algorithm in various contexts, for example when the underlying assumptions do not hold. We conclude with recommendations and potential further improvements for the field. Elsevier 2021-03-29 /pmc/articles/PMC9216349/ /pubmed/35757062 http://dx.doi.org/10.1016/j.cpnec.2021.100052 Text en © 2021 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Herbers, Judith
Miller, Robert
Walther, Andreas
Schindler, Lena
Schmidt, Kornelius
Gao, Wei
Rupprecht, Florian
How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches
title How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches
title_full How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches
title_fullStr How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches
title_full_unstemmed How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches
title_short How to deal with non-detectable and outlying values in biomarker research: Best practices and recommendations for univariate imputation approaches
title_sort how to deal with non-detectable and outlying values in biomarker research: best practices and recommendations for univariate imputation approaches
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9216349/
https://www.ncbi.nlm.nih.gov/pubmed/35757062
http://dx.doi.org/10.1016/j.cpnec.2021.100052
work_keys_str_mv AT herbersjudith howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches
AT millerrobert howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches
AT waltherandreas howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches
AT schindlerlena howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches
AT schmidtkornelius howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches
AT gaowei howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches
AT rupprechtflorian howtodealwithnondetectableandoutlyingvaluesinbiomarkerresearchbestpracticesandrecommendationsforunivariateimputationapproaches