Cargando…

Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes

The success of genome-wide association studies (GWASs) has enabled us to improve risk assessment and provide novel genetic variants for diagnosis, prevention, and treatment. However, most variants discovered by GWASs have been reported to have very small effect sizes on complex human diseases, which...

Descripción completa

Detalles Bibliográficos
Autores principales: Choi, Sungkyoung, Bae, Sunghwan, Park, Taesung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korea Genome Organization 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5287117/
https://www.ncbi.nlm.nih.gov/pubmed/28154504
http://dx.doi.org/10.5808/GI.2016.14.4.138
_version_ 1782504108241977344
author Choi, Sungkyoung
Bae, Sunghwan
Park, Taesung
author_facet Choi, Sungkyoung
Bae, Sunghwan
Park, Taesung
author_sort Choi, Sungkyoung
collection PubMed
description The success of genome-wide association studies (GWASs) has enabled us to improve risk assessment and provide novel genetic variants for diagnosis, prevention, and treatment. However, most variants discovered by GWASs have been reported to have very small effect sizes on complex human diseases, which has been a big hurdle in building risk prediction models. Recently, many statistical approaches based on penalized regression have been developed to solve the “large p and small n” problem. In this report, we evaluated the performance of several statistical methods for predicting a binary trait: stepwise logistic regression (SLR), least absolute shrinkage and selection operator (LASSO), and Elastic-Net (EN). We first built a prediction model by combining variable selection and prediction methods for type 2 diabetes using Affymetrix Genome-Wide Human SNP Array 5.0 from the Korean Association Resource project. We assessed the risk prediction performance using area under the receiver operating characteristic curve (AUC) for the internal and external validation datasets. In the internal validation, SLR-LASSO and SLR-EN tended to yield more accurate predictions than other combinations. During the external validation, the SLR-SLR and SLR-EN combinations achieved the highest AUC of 0.726. We propose these combinations as a potentially powerful risk prediction model for type 2 diabetes.
format Online
Article
Text
id pubmed-5287117
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Korea Genome Organization
record_format MEDLINE/PubMed
spelling pubmed-52871172017-02-02 Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes Choi, Sungkyoung Bae, Sunghwan Park, Taesung Genomics Inform Original Article The success of genome-wide association studies (GWASs) has enabled us to improve risk assessment and provide novel genetic variants for diagnosis, prevention, and treatment. However, most variants discovered by GWASs have been reported to have very small effect sizes on complex human diseases, which has been a big hurdle in building risk prediction models. Recently, many statistical approaches based on penalized regression have been developed to solve the “large p and small n” problem. In this report, we evaluated the performance of several statistical methods for predicting a binary trait: stepwise logistic regression (SLR), least absolute shrinkage and selection operator (LASSO), and Elastic-Net (EN). We first built a prediction model by combining variable selection and prediction methods for type 2 diabetes using Affymetrix Genome-Wide Human SNP Array 5.0 from the Korean Association Resource project. We assessed the risk prediction performance using area under the receiver operating characteristic curve (AUC) for the internal and external validation datasets. In the internal validation, SLR-LASSO and SLR-EN tended to yield more accurate predictions than other combinations. During the external validation, the SLR-SLR and SLR-EN combinations achieved the highest AUC of 0.726. We propose these combinations as a potentially powerful risk prediction model for type 2 diabetes. Korea Genome Organization 2016-12 2016-12-30 /pmc/articles/PMC5287117/ /pubmed/28154504 http://dx.doi.org/10.5808/GI.2016.14.4.138 Text en Copyright © 2016 by the Korea Genome Organization http://creativecommons.org/licenses/by-nc/4.0/ It is identical to the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/).
spellingShingle Original Article
Choi, Sungkyoung
Bae, Sunghwan
Park, Taesung
Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes
title Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes
title_full Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes
title_fullStr Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes
title_full_unstemmed Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes
title_short Risk Prediction Using Genome-Wide Association Studies on Type 2 Diabetes
title_sort risk prediction using genome-wide association studies on type 2 diabetes
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5287117/
https://www.ncbi.nlm.nih.gov/pubmed/28154504
http://dx.doi.org/10.5808/GI.2016.14.4.138
work_keys_str_mv AT choisungkyoung riskpredictionusinggenomewideassociationstudiesontype2diabetes
AT baesunghwan riskpredictionusinggenomewideassociationstudiesontype2diabetes
AT parktaesung riskpredictionusinggenomewideassociationstudiesontype2diabetes