Cargando…

The Genetic Interpretation of Area under the ROC Curve in Genomic Profiling

Genome-wide association studies in human populations have facilitated the creation of genomic profiles which combine the effects of many associated genetic variants to predict risk of disease. The area under the receiver operator characteristic (ROC) curve is a well established measure for determini...

Descripción completa

Detalles Bibliográficos
Autores principales: Wray, Naomi R., Yang, Jian, Goddard, Michael E., Visscher, Peter M.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2829056/
https://www.ncbi.nlm.nih.gov/pubmed/20195508
http://dx.doi.org/10.1371/journal.pgen.1000864
_version_ 1782178069752053760
author Wray, Naomi R.
Yang, Jian
Goddard, Michael E.
Visscher, Peter M.
author_facet Wray, Naomi R.
Yang, Jian
Goddard, Michael E.
Visscher, Peter M.
author_sort Wray, Naomi R.
collection PubMed
description Genome-wide association studies in human populations have facilitated the creation of genomic profiles which combine the effects of many associated genetic variants to predict risk of disease. The area under the receiver operator characteristic (ROC) curve is a well established measure for determining the efficacy of tests in correctly classifying diseased and non-diseased individuals. We use quantitative genetics theory to provide insight into the genetic interpretation of the area under the ROC curve (AUC) when the test classifier is a predictor of genetic risk. Even when the proportion of genetic variance explained by the test is 100%, there is a maximum value for AUC that depends on the genetic epidemiology of the disease, i.e. either the sibling recurrence risk or heritability and disease prevalence. We derive an equation relating maximum AUC to heritability and disease prevalence. The expression can be reversed to calculate the proportion of genetic variance explained given AUC, disease prevalence, and heritability. We use published estimates of disease prevalence and sibling recurrence risk for 17 complex genetic diseases to calculate the proportion of genetic variance that a test must explain to achieve AUC = 0.75; this varied from 0.10 to 0.74. We provide a genetic interpretation of AUC for use with predictors of genetic risk based on genomic profiles. We provide a strategy to estimate proportion of genetic variance explained on the liability scale from estimates of AUC, disease prevalence, and heritability (or sibling recurrence risk) available as an online calculator.
format Text
id pubmed-2829056
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28290562010-03-02 The Genetic Interpretation of Area under the ROC Curve in Genomic Profiling Wray, Naomi R. Yang, Jian Goddard, Michael E. Visscher, Peter M. PLoS Genet Research Article Genome-wide association studies in human populations have facilitated the creation of genomic profiles which combine the effects of many associated genetic variants to predict risk of disease. The area under the receiver operator characteristic (ROC) curve is a well established measure for determining the efficacy of tests in correctly classifying diseased and non-diseased individuals. We use quantitative genetics theory to provide insight into the genetic interpretation of the area under the ROC curve (AUC) when the test classifier is a predictor of genetic risk. Even when the proportion of genetic variance explained by the test is 100%, there is a maximum value for AUC that depends on the genetic epidemiology of the disease, i.e. either the sibling recurrence risk or heritability and disease prevalence. We derive an equation relating maximum AUC to heritability and disease prevalence. The expression can be reversed to calculate the proportion of genetic variance explained given AUC, disease prevalence, and heritability. We use published estimates of disease prevalence and sibling recurrence risk for 17 complex genetic diseases to calculate the proportion of genetic variance that a test must explain to achieve AUC = 0.75; this varied from 0.10 to 0.74. We provide a genetic interpretation of AUC for use with predictors of genetic risk based on genomic profiles. We provide a strategy to estimate proportion of genetic variance explained on the liability scale from estimates of AUC, disease prevalence, and heritability (or sibling recurrence risk) available as an online calculator. Public Library of Science 2010-02-26 /pmc/articles/PMC2829056/ /pubmed/20195508 http://dx.doi.org/10.1371/journal.pgen.1000864 Text en Wray et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Wray, Naomi R.
Yang, Jian
Goddard, Michael E.
Visscher, Peter M.
The Genetic Interpretation of Area under the ROC Curve in Genomic Profiling
title The Genetic Interpretation of Area under the ROC Curve in Genomic Profiling
title_full The Genetic Interpretation of Area under the ROC Curve in Genomic Profiling
title_fullStr The Genetic Interpretation of Area under the ROC Curve in Genomic Profiling
title_full_unstemmed The Genetic Interpretation of Area under the ROC Curve in Genomic Profiling
title_short The Genetic Interpretation of Area under the ROC Curve in Genomic Profiling
title_sort genetic interpretation of area under the roc curve in genomic profiling
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2829056/
https://www.ncbi.nlm.nih.gov/pubmed/20195508
http://dx.doi.org/10.1371/journal.pgen.1000864
work_keys_str_mv AT wraynaomir thegeneticinterpretationofareaundertheroccurveingenomicprofiling
AT yangjian thegeneticinterpretationofareaundertheroccurveingenomicprofiling
AT goddardmichaele thegeneticinterpretationofareaundertheroccurveingenomicprofiling
AT visscherpeterm thegeneticinterpretationofareaundertheroccurveingenomicprofiling
AT wraynaomir geneticinterpretationofareaundertheroccurveingenomicprofiling
AT yangjian geneticinterpretationofareaundertheroccurveingenomicprofiling
AT goddardmichaele geneticinterpretationofareaundertheroccurveingenomicprofiling
AT visscherpeterm geneticinterpretationofareaundertheroccurveingenomicprofiling