Cargando…

Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes

The achievements of genome-wide association studies have suggested ways to predict diseases, such as type 2 diabetes (T2D), using single-nucleotide polymorphisms (SNPs). Most T2D risk prediction models have used SNPs in combination with demographic variables. However, it is difficult to evaluate the...

Descripción completa

Detalles Bibliográficos
Autores principales: Park, Chanwoo, Jiang, Nan, Park, Taesung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korea Genome Organization 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6944048/
https://www.ncbi.nlm.nih.gov/pubmed/31896247
http://dx.doi.org/10.5808/GI.2019.17.4.e47
_version_ 1783484987109015552
author Park, Chanwoo
Jiang, Nan
Park, Taesung
author_facet Park, Chanwoo
Jiang, Nan
Park, Taesung
author_sort Park, Chanwoo
collection PubMed
description The achievements of genome-wide association studies have suggested ways to predict diseases, such as type 2 diabetes (T2D), using single-nucleotide polymorphisms (SNPs). Most T2D risk prediction models have used SNPs in combination with demographic variables. However, it is difficult to evaluate the pure additive contribution of genetic variants to classically used demographic models. Since prediction models include some heritable traits, such as body mass index, the contribution of SNPs using unmatched case-control samples may be underestimated. In this article, we propose a method that uses propensity score matching to avoid underestimation by matching case and control samples, thereby determining the pure additive contribution of SNPs. To illustrate the proposed propensity score matching method, we used SNP data from the Korea Association Resources project and reported SNPs from the genome-wide association study catalog. We selected various SNP sets via stepwise logistic regression (SLR), least absolute shrinkage and selection operator (LASSO), and the elastic-net (EN) algorithm. Using these SNP sets, we made predictions using SLR, LASSO, and EN as logistic regression modeling techniques. The accuracy of the predictions was compared in terms of area under the receiver operating characteristic curve (AUC). The contribution of SNPs to T2D was evaluated by the difference in the AUC between models using only demographic variables and models that included the SNPs. The largest difference among our models showed that the AUC of the model using genetic variants with demographic variables could be 0.107 higher than that of the corresponding model using only demographic variables.
format Online
Article
Text
id pubmed-6944048
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Korea Genome Organization
record_format MEDLINE/PubMed
spelling pubmed-69440482020-01-09 Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes Park, Chanwoo Jiang, Nan Park, Taesung Genomics Inform Original Article The achievements of genome-wide association studies have suggested ways to predict diseases, such as type 2 diabetes (T2D), using single-nucleotide polymorphisms (SNPs). Most T2D risk prediction models have used SNPs in combination with demographic variables. However, it is difficult to evaluate the pure additive contribution of genetic variants to classically used demographic models. Since prediction models include some heritable traits, such as body mass index, the contribution of SNPs using unmatched case-control samples may be underestimated. In this article, we propose a method that uses propensity score matching to avoid underestimation by matching case and control samples, thereby determining the pure additive contribution of SNPs. To illustrate the proposed propensity score matching method, we used SNP data from the Korea Association Resources project and reported SNPs from the genome-wide association study catalog. We selected various SNP sets via stepwise logistic regression (SLR), least absolute shrinkage and selection operator (LASSO), and the elastic-net (EN) algorithm. Using these SNP sets, we made predictions using SLR, LASSO, and EN as logistic regression modeling techniques. The accuracy of the predictions was compared in terms of area under the receiver operating characteristic curve (AUC). The contribution of SNPs to T2D was evaluated by the difference in the AUC between models using only demographic variables and models that included the SNPs. The largest difference among our models showed that the AUC of the model using genetic variants with demographic variables could be 0.107 higher than that of the corresponding model using only demographic variables. Korea Genome Organization 2019-12-23 /pmc/articles/PMC6944048/ /pubmed/31896247 http://dx.doi.org/10.5808/GI.2019.17.4.e47 Text en (c) 2019, Korea Genome Organization (CC) This is an open-access article distributed under the terms of the Creative Commons Attribution license(https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Park, Chanwoo
Jiang, Nan
Park, Taesung
Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes
title Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes
title_full Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes
title_fullStr Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes
title_full_unstemmed Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes
title_short Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes
title_sort pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6944048/
https://www.ncbi.nlm.nih.gov/pubmed/31896247
http://dx.doi.org/10.5808/GI.2019.17.4.e47
work_keys_str_mv AT parkchanwoo pureadditivecontributionofgeneticvariantstoariskpredictionmodelusingpropensityscorematchingapplicationtotype2diabetes
AT jiangnan pureadditivecontributionofgeneticvariantstoariskpredictionmodelusingpropensityscorematchingapplicationtotype2diabetes
AT parktaesung pureadditivecontributionofgeneticvariantstoariskpredictionmodelusingpropensityscorematchingapplicationtotype2diabetes