Cargando…

Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies

Two-phase sampling design, where biomarkers are subsampled from a phase-one cohort sample representative of the target population, has become the gold standard in biomarker evaluation. Many two-phase case–control studies involve biased sampling of cases and/or controls in the second phase. For examp...

Descripción completa

Detalles Bibliográficos
Autor principal: Huang, Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4915610/
https://www.ncbi.nlm.nih.gov/pubmed/26883772
http://dx.doi.org/10.1093/biostatistics/kxw003
_version_ 1782438710463168512
author Huang, Ying
author_facet Huang, Ying
author_sort Huang, Ying
collection PubMed
description Two-phase sampling design, where biomarkers are subsampled from a phase-one cohort sample representative of the target population, has become the gold standard in biomarker evaluation. Many two-phase case–control studies involve biased sampling of cases and/or controls in the second phase. For example, controls are often frequency-matched to cases with respect to other covariates. Ignoring biased sampling of cases and/or controls can lead to biased inference regarding biomarkers' classification accuracy. Considering the problems of estimating and comparing the area under the receiver operating characteristics curve (AUC) for a binary disease outcome, the impact of biased sampling of cases and/or controls on inference and the strategy to efficiently account for the sampling scheme have not been well studied. In this project, we investigate the inverse-probability-weighted method to adjust for biased sampling in estimating and comparing AUC. Asymptotic properties of the estimator and its inference procedure are developed for both Bernoulli sampling and finite-population stratified sampling. In simulation studies, the weighted estimators provide valid inference for estimation and hypothesis testing, while the standard empirical estimators can generate invalid inference. We demonstrate the use of the analytical variance formula for optimizing sampling schemes in biomarker study design and the application of the proposed AUC estimators to examples in HIV vaccine research and prostate cancer research.
format Online
Article
Text
id pubmed-4915610
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-49156102016-06-22 Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies Huang, Ying Biostatistics Articles Two-phase sampling design, where biomarkers are subsampled from a phase-one cohort sample representative of the target population, has become the gold standard in biomarker evaluation. Many two-phase case–control studies involve biased sampling of cases and/or controls in the second phase. For example, controls are often frequency-matched to cases with respect to other covariates. Ignoring biased sampling of cases and/or controls can lead to biased inference regarding biomarkers' classification accuracy. Considering the problems of estimating and comparing the area under the receiver operating characteristics curve (AUC) for a binary disease outcome, the impact of biased sampling of cases and/or controls on inference and the strategy to efficiently account for the sampling scheme have not been well studied. In this project, we investigate the inverse-probability-weighted method to adjust for biased sampling in estimating and comparing AUC. Asymptotic properties of the estimator and its inference procedure are developed for both Bernoulli sampling and finite-population stratified sampling. In simulation studies, the weighted estimators provide valid inference for estimation and hypothesis testing, while the standard empirical estimators can generate invalid inference. We demonstrate the use of the analytical variance formula for optimizing sampling schemes in biomarker study design and the application of the proposed AUC estimators to examples in HIV vaccine research and prostate cancer research. Oxford University Press 2016-07 2016-02-16 /pmc/articles/PMC4915610/ /pubmed/26883772 http://dx.doi.org/10.1093/biostatistics/kxw003 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Huang, Ying
Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies
title Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies
title_full Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies
title_fullStr Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies
title_full_unstemmed Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies
title_short Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies
title_sort evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4915610/
https://www.ncbi.nlm.nih.gov/pubmed/26883772
http://dx.doi.org/10.1093/biostatistics/kxw003
work_keys_str_mv AT huangying evaluatingandcomparingbiomarkerswithrespecttotheareaunderthereceiveroperatingcharacteristicscurveintwophasecasecontrolstudies