Cargando…
Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes
The achievements of genome-wide association studies have suggested ways to predict diseases, such as type 2 diabetes (T2D), using single-nucleotide polymorphisms (SNPs). Most T2D risk prediction models have used SNPs in combination with demographic variables. However, it is difficult to evaluate the...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Korea Genome Organization
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6944048/ https://www.ncbi.nlm.nih.gov/pubmed/31896247 http://dx.doi.org/10.5808/GI.2019.17.4.e47 |
_version_ | 1783484987109015552 |
---|---|
author | Park, Chanwoo Jiang, Nan Park, Taesung |
author_facet | Park, Chanwoo Jiang, Nan Park, Taesung |
author_sort | Park, Chanwoo |
collection | PubMed |
description | The achievements of genome-wide association studies have suggested ways to predict diseases, such as type 2 diabetes (T2D), using single-nucleotide polymorphisms (SNPs). Most T2D risk prediction models have used SNPs in combination with demographic variables. However, it is difficult to evaluate the pure additive contribution of genetic variants to classically used demographic models. Since prediction models include some heritable traits, such as body mass index, the contribution of SNPs using unmatched case-control samples may be underestimated. In this article, we propose a method that uses propensity score matching to avoid underestimation by matching case and control samples, thereby determining the pure additive contribution of SNPs. To illustrate the proposed propensity score matching method, we used SNP data from the Korea Association Resources project and reported SNPs from the genome-wide association study catalog. We selected various SNP sets via stepwise logistic regression (SLR), least absolute shrinkage and selection operator (LASSO), and the elastic-net (EN) algorithm. Using these SNP sets, we made predictions using SLR, LASSO, and EN as logistic regression modeling techniques. The accuracy of the predictions was compared in terms of area under the receiver operating characteristic curve (AUC). The contribution of SNPs to T2D was evaluated by the difference in the AUC between models using only demographic variables and models that included the SNPs. The largest difference among our models showed that the AUC of the model using genetic variants with demographic variables could be 0.107 higher than that of the corresponding model using only demographic variables. |
format | Online Article Text |
id | pubmed-6944048 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Korea Genome Organization |
record_format | MEDLINE/PubMed |
spelling | pubmed-69440482020-01-09 Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes Park, Chanwoo Jiang, Nan Park, Taesung Genomics Inform Original Article The achievements of genome-wide association studies have suggested ways to predict diseases, such as type 2 diabetes (T2D), using single-nucleotide polymorphisms (SNPs). Most T2D risk prediction models have used SNPs in combination with demographic variables. However, it is difficult to evaluate the pure additive contribution of genetic variants to classically used demographic models. Since prediction models include some heritable traits, such as body mass index, the contribution of SNPs using unmatched case-control samples may be underestimated. In this article, we propose a method that uses propensity score matching to avoid underestimation by matching case and control samples, thereby determining the pure additive contribution of SNPs. To illustrate the proposed propensity score matching method, we used SNP data from the Korea Association Resources project and reported SNPs from the genome-wide association study catalog. We selected various SNP sets via stepwise logistic regression (SLR), least absolute shrinkage and selection operator (LASSO), and the elastic-net (EN) algorithm. Using these SNP sets, we made predictions using SLR, LASSO, and EN as logistic regression modeling techniques. The accuracy of the predictions was compared in terms of area under the receiver operating characteristic curve (AUC). The contribution of SNPs to T2D was evaluated by the difference in the AUC between models using only demographic variables and models that included the SNPs. The largest difference among our models showed that the AUC of the model using genetic variants with demographic variables could be 0.107 higher than that of the corresponding model using only demographic variables. Korea Genome Organization 2019-12-23 /pmc/articles/PMC6944048/ /pubmed/31896247 http://dx.doi.org/10.5808/GI.2019.17.4.e47 Text en (c) 2019, Korea Genome Organization (CC) This is an open-access article distributed under the terms of the Creative Commons Attribution license(https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Park, Chanwoo Jiang, Nan Park, Taesung Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes |
title | Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes |
title_full | Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes |
title_fullStr | Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes |
title_full_unstemmed | Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes |
title_short | Pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes |
title_sort | pure additive contribution of genetic variants to a risk prediction model using propensity score matching: application to type 2 diabetes |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6944048/ https://www.ncbi.nlm.nih.gov/pubmed/31896247 http://dx.doi.org/10.5808/GI.2019.17.4.e47 |
work_keys_str_mv | AT parkchanwoo pureadditivecontributionofgeneticvariantstoariskpredictionmodelusingpropensityscorematchingapplicationtotype2diabetes AT jiangnan pureadditivecontributionofgeneticvariantstoariskpredictionmodelusingpropensityscorematchingapplicationtotype2diabetes AT parktaesung pureadditivecontributionofgeneticvariantstoariskpredictionmodelusingpropensityscorematchingapplicationtotype2diabetes |