Cargando…

Evaluation of Risk Prediction with Hierarchical Data: Dependency Adjusted Confidence Intervals for the AUC

The area under the true ROC curve (AUC) is routinely used to determine how strongly a given model discriminates between the levels of a binary outcome. Standard inference with the AUC requires that outcomes be independent of each other. To overcome this limitation, a method was developed for the est...

Descripción completa

Detalles Bibliográficos
Autores principales: Bay, Camden, Glynn, Robert J, Seddon, Johanna M, Lee, Mei-Ling Ting, Rosner, Bernard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10621602/
https://www.ncbi.nlm.nih.gov/pubmed/37920864
http://dx.doi.org/10.3390/stats6020034
_version_ 1785130402704261120
author Bay, Camden
Glynn, Robert J
Seddon, Johanna M
Lee, Mei-Ling Ting
Rosner, Bernard
author_facet Bay, Camden
Glynn, Robert J
Seddon, Johanna M
Lee, Mei-Ling Ting
Rosner, Bernard
author_sort Bay, Camden
collection PubMed
description The area under the true ROC curve (AUC) is routinely used to determine how strongly a given model discriminates between the levels of a binary outcome. Standard inference with the AUC requires that outcomes be independent of each other. To overcome this limitation, a method was developed for the estimation of the variance of the AUC in the setting of two-level hierarchical data using probit-transformed prediction scores generated from generalized estimating equation models, thereby allowing for the application of inferential methods. This manuscript presents an extension of this approach so that inference for the AUC may be performed in a three-level hierarchical data setting (e.g., eyes nested within persons and persons nested within families). A method that accounts for the effect of tied prediction scores on inference is also described. The performance of 95% confidence intervals around the AUC was assessed through the simulation of three-level clustered data in multiple settings, including ones with tied data and variable cluster sizes. Across all settings, the actual 95% confidence interval coverage varied from 0.943 to 0.958, and the ratio of the theoretical variance to the empirical variance of the AUC varied from 0.920 to 1.013. The results are better than those from existing methods. Two examples of applying the proposed methodology are presented.
format Online
Article
Text
id pubmed-10621602
institution National Center for Biotechnology Information
language English
publishDate 2023
record_format MEDLINE/PubMed
spelling pubmed-106216022023-11-02 Evaluation of Risk Prediction with Hierarchical Data: Dependency Adjusted Confidence Intervals for the AUC Bay, Camden Glynn, Robert J Seddon, Johanna M Lee, Mei-Ling Ting Rosner, Bernard Stats (Basel) Article The area under the true ROC curve (AUC) is routinely used to determine how strongly a given model discriminates between the levels of a binary outcome. Standard inference with the AUC requires that outcomes be independent of each other. To overcome this limitation, a method was developed for the estimation of the variance of the AUC in the setting of two-level hierarchical data using probit-transformed prediction scores generated from generalized estimating equation models, thereby allowing for the application of inferential methods. This manuscript presents an extension of this approach so that inference for the AUC may be performed in a three-level hierarchical data setting (e.g., eyes nested within persons and persons nested within families). A method that accounts for the effect of tied prediction scores on inference is also described. The performance of 95% confidence intervals around the AUC was assessed through the simulation of three-level clustered data in multiple settings, including ones with tied data and variable cluster sizes. Across all settings, the actual 95% confidence interval coverage varied from 0.943 to 0.958, and the ratio of the theoretical variance to the empirical variance of the AUC varied from 0.920 to 1.013. The results are better than those from existing methods. Two examples of applying the proposed methodology are presented. 2023-06 2023-04-24 /pmc/articles/PMC10621602/ /pubmed/37920864 http://dx.doi.org/10.3390/stats6020034 Text en https://creativecommons.org/licenses/by/4.0/Submitted for possible open access publication under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Bay, Camden
Glynn, Robert J
Seddon, Johanna M
Lee, Mei-Ling Ting
Rosner, Bernard
Evaluation of Risk Prediction with Hierarchical Data: Dependency Adjusted Confidence Intervals for the AUC
title Evaluation of Risk Prediction with Hierarchical Data: Dependency Adjusted Confidence Intervals for the AUC
title_full Evaluation of Risk Prediction with Hierarchical Data: Dependency Adjusted Confidence Intervals for the AUC
title_fullStr Evaluation of Risk Prediction with Hierarchical Data: Dependency Adjusted Confidence Intervals for the AUC
title_full_unstemmed Evaluation of Risk Prediction with Hierarchical Data: Dependency Adjusted Confidence Intervals for the AUC
title_short Evaluation of Risk Prediction with Hierarchical Data: Dependency Adjusted Confidence Intervals for the AUC
title_sort evaluation of risk prediction with hierarchical data: dependency adjusted confidence intervals for the auc
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10621602/
https://www.ncbi.nlm.nih.gov/pubmed/37920864
http://dx.doi.org/10.3390/stats6020034
work_keys_str_mv AT baycamden evaluationofriskpredictionwithhierarchicaldatadependencyadjustedconfidenceintervalsfortheauc
AT glynnrobertj evaluationofriskpredictionwithhierarchicaldatadependencyadjustedconfidenceintervalsfortheauc
AT seddonjohannam evaluationofriskpredictionwithhierarchicaldatadependencyadjustedconfidenceintervalsfortheauc
AT leemeilingting evaluationofriskpredictionwithhierarchicaldatadependencyadjustedconfidenceintervalsfortheauc
AT rosnerbernard evaluationofriskpredictionwithhierarchicaldatadependencyadjustedconfidenceintervalsfortheauc