Cargando…
Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies
Two-phase sampling design, where biomarkers are subsampled from a phase-one cohort sample representative of the target population, has become the gold standard in biomarker evaluation. Many two-phase case–control studies involve biased sampling of cases and/or controls in the second phase. For examp...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4915610/ https://www.ncbi.nlm.nih.gov/pubmed/26883772 http://dx.doi.org/10.1093/biostatistics/kxw003 |
_version_ | 1782438710463168512 |
---|---|
author | Huang, Ying |
author_facet | Huang, Ying |
author_sort | Huang, Ying |
collection | PubMed |
description | Two-phase sampling design, where biomarkers are subsampled from a phase-one cohort sample representative of the target population, has become the gold standard in biomarker evaluation. Many two-phase case–control studies involve biased sampling of cases and/or controls in the second phase. For example, controls are often frequency-matched to cases with respect to other covariates. Ignoring biased sampling of cases and/or controls can lead to biased inference regarding biomarkers' classification accuracy. Considering the problems of estimating and comparing the area under the receiver operating characteristics curve (AUC) for a binary disease outcome, the impact of biased sampling of cases and/or controls on inference and the strategy to efficiently account for the sampling scheme have not been well studied. In this project, we investigate the inverse-probability-weighted method to adjust for biased sampling in estimating and comparing AUC. Asymptotic properties of the estimator and its inference procedure are developed for both Bernoulli sampling and finite-population stratified sampling. In simulation studies, the weighted estimators provide valid inference for estimation and hypothesis testing, while the standard empirical estimators can generate invalid inference. We demonstrate the use of the analytical variance formula for optimizing sampling schemes in biomarker study design and the application of the proposed AUC estimators to examples in HIV vaccine research and prostate cancer research. |
format | Online Article Text |
id | pubmed-4915610 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-49156102016-06-22 Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies Huang, Ying Biostatistics Articles Two-phase sampling design, where biomarkers are subsampled from a phase-one cohort sample representative of the target population, has become the gold standard in biomarker evaluation. Many two-phase case–control studies involve biased sampling of cases and/or controls in the second phase. For example, controls are often frequency-matched to cases with respect to other covariates. Ignoring biased sampling of cases and/or controls can lead to biased inference regarding biomarkers' classification accuracy. Considering the problems of estimating and comparing the area under the receiver operating characteristics curve (AUC) for a binary disease outcome, the impact of biased sampling of cases and/or controls on inference and the strategy to efficiently account for the sampling scheme have not been well studied. In this project, we investigate the inverse-probability-weighted method to adjust for biased sampling in estimating and comparing AUC. Asymptotic properties of the estimator and its inference procedure are developed for both Bernoulli sampling and finite-population stratified sampling. In simulation studies, the weighted estimators provide valid inference for estimation and hypothesis testing, while the standard empirical estimators can generate invalid inference. We demonstrate the use of the analytical variance formula for optimizing sampling schemes in biomarker study design and the application of the proposed AUC estimators to examples in HIV vaccine research and prostate cancer research. Oxford University Press 2016-07 2016-02-16 /pmc/articles/PMC4915610/ /pubmed/26883772 http://dx.doi.org/10.1093/biostatistics/kxw003 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Articles Huang, Ying Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies |
title | Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies |
title_full | Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies |
title_fullStr | Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies |
title_full_unstemmed | Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies |
title_short | Evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies |
title_sort | evaluating and comparing biomarkers with respect to the area under the receiver operating characteristics curve in two-phase case–control studies |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4915610/ https://www.ncbi.nlm.nih.gov/pubmed/26883772 http://dx.doi.org/10.1093/biostatistics/kxw003 |
work_keys_str_mv | AT huangying evaluatingandcomparingbiomarkerswithrespecttotheareaunderthereceiveroperatingcharacteristicscurveintwophasecasecontrolstudies |