Cargando…

Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk

BACKGROUND: Modeling studies using hypothetical polygenic risk data can be an efficient tool for investigating the effectiveness of downstream applications such as targeting interventions to risk groups to justify whether empirical investigation is warranted. We investigated the assumptions underlyi...

Descripción completa

Detalles Bibliográficos
Autores principales: Kundu, Suman, Kers, Jannigje G., Janssens, A. Cecile J. W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4811433/
https://www.ncbi.nlm.nih.gov/pubmed/27023073
http://dx.doi.org/10.1371/journal.pone.0152359
_version_ 1782423956233388032
author Kundu, Suman
Kers, Jannigje G.
Janssens, A. Cecile J. W.
author_facet Kundu, Suman
Kers, Jannigje G.
Janssens, A. Cecile J. W.
author_sort Kundu, Suman
collection PubMed
description BACKGROUND: Modeling studies using hypothetical polygenic risk data can be an efficient tool for investigating the effectiveness of downstream applications such as targeting interventions to risk groups to justify whether empirical investigation is warranted. We investigated the assumptions underlying a method that simulates risk data for specific values of the area under the receiver operating characteristic curve (AUC). METHODS: The simulation method constructs risk data for a hypothetical population based on the population disease risk, and the odds ratios and frequencies of genetic variants. By systematically varying the parameters, we investigated under what conditions AUC values represent unique ROC curves with unique risk distributions for patients and nonpatients, and to what extend risk data can be simulated for precise values of the AUC. RESULTS: Using larger number of genetic variants each with a modest effect, we observed that the distributions of estimated risks of patients and nonpatients were similar for various combinations of the odds ratios and frequencies of the risk alleles. Simulated ROC curves overlapped empirical curves with the same AUC. CONCLUSIONS: Polygenic risk data can be effectively and efficiently created using a simulation method. This allows to further investigate the potential applications of stratifying interventions on the basis of polygenic risk.
format Online
Article
Text
id pubmed-4811433
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-48114332016-04-05 Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk Kundu, Suman Kers, Jannigje G. Janssens, A. Cecile J. W. PLoS One Research Article BACKGROUND: Modeling studies using hypothetical polygenic risk data can be an efficient tool for investigating the effectiveness of downstream applications such as targeting interventions to risk groups to justify whether empirical investigation is warranted. We investigated the assumptions underlying a method that simulates risk data for specific values of the area under the receiver operating characteristic curve (AUC). METHODS: The simulation method constructs risk data for a hypothetical population based on the population disease risk, and the odds ratios and frequencies of genetic variants. By systematically varying the parameters, we investigated under what conditions AUC values represent unique ROC curves with unique risk distributions for patients and nonpatients, and to what extend risk data can be simulated for precise values of the AUC. RESULTS: Using larger number of genetic variants each with a modest effect, we observed that the distributions of estimated risks of patients and nonpatients were similar for various combinations of the odds ratios and frequencies of the risk alleles. Simulated ROC curves overlapped empirical curves with the same AUC. CONCLUSIONS: Polygenic risk data can be effectively and efficiently created using a simulation method. This allows to further investigate the potential applications of stratifying interventions on the basis of polygenic risk. Public Library of Science 2016-03-29 /pmc/articles/PMC4811433/ /pubmed/27023073 http://dx.doi.org/10.1371/journal.pone.0152359 Text en © 2016 Kundu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kundu, Suman
Kers, Jannigje G.
Janssens, A. Cecile J. W.
Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk
title Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk
title_full Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk
title_fullStr Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk
title_full_unstemmed Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk
title_short Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk
title_sort constructing hypothetical risk data from the area under the roc curve: modelling distributions of polygenic risk
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4811433/
https://www.ncbi.nlm.nih.gov/pubmed/27023073
http://dx.doi.org/10.1371/journal.pone.0152359
work_keys_str_mv AT kundusuman constructinghypotheticalriskdatafromtheareaundertheroccurvemodellingdistributionsofpolygenicrisk
AT kersjannigjeg constructinghypotheticalriskdatafromtheareaundertheroccurvemodellingdistributionsofpolygenicrisk
AT janssensacecilejw constructinghypotheticalriskdatafromtheareaundertheroccurvemodellingdistributionsofpolygenicrisk