Cargando…

Assessment of Whole-Genome Regression for Type II Diabetes

Lifestyle and genetic factors play a large role in the development of Type 2 Diabetes (T2D). Despite the important role of genetic factors, genetic information is not incorporated into the clinical assessment of T2D risk. We assessed and compared Whole Genome Regression methods to predict the T2D st...

Descripción completa

Detalles Bibliográficos
Autores principales: Vazquez, Ana I., Klimentidis, Yann C., Dhurandhar, Emily J., Veturi, Yogasudha C., Paérez-Rodríguez, Paulino
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4401705/
https://www.ncbi.nlm.nih.gov/pubmed/25885636
http://dx.doi.org/10.1371/journal.pone.0123818
_version_ 1782367177500786688
author Vazquez, Ana I.
Klimentidis, Yann C.
Dhurandhar, Emily J.
Veturi, Yogasudha C.
Paérez-Rodríguez, Paulino
author_facet Vazquez, Ana I.
Klimentidis, Yann C.
Dhurandhar, Emily J.
Veturi, Yogasudha C.
Paérez-Rodríguez, Paulino
author_sort Vazquez, Ana I.
collection PubMed
description Lifestyle and genetic factors play a large role in the development of Type 2 Diabetes (T2D). Despite the important role of genetic factors, genetic information is not incorporated into the clinical assessment of T2D risk. We assessed and compared Whole Genome Regression methods to predict the T2D status of 5,245 subjects from the Framingham Heart Study. For evaluating each method we constructed the following set of regression models: A clinical baseline model (CBM) which included non-genetic covariates only. CBM was extended by adding the first two marker-derived principal components and 65 SNPs identified by a recent GWAS consortium for T2D (M-65SNPs). Subsequently, it was further extended by adding 249,798 genome-wide SNPs from a high-density array. The Bayesian models used to incorporate genome-wide marker information as predictors were: Bayes A, Bayes Cπ, Bayesian LASSO (BL), and the Genomic Best Linear Unbiased Prediction (G-BLUP). Results included estimates of the genetic variance and heritability, genetic scores for T2D, and predictive ability evaluated in a 10-fold cross-validation. The predictive AUC estimates for CBM and M-65SNPs were: 0.668 and 0.684, respectively. We found evidence of contribution of genetic effects in T2D, as reflected in the genomic heritability estimates (0.492±0.066). The highest predictive AUC among the genome-wide marker Bayesian models was 0.681 for the Bayesian LASSO. Overall, the improvement in predictive ability was moderate and did not differ greatly among models that included genetic information. Approximately 58% of the total number of genetic variants was found to contribute to the overall genetic variation, indicating a complex genetic architecture for T2D. Our results suggest that the Bayes Cπ and the G-BLUP models with a large set of genome-wide markers could be used for predicting risk to T2D, as an alternative to using high-density arrays when selected markers from large consortiums for a given complex trait or disease are unavailable.
format Online
Article
Text
id pubmed-4401705
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44017052015-04-21 Assessment of Whole-Genome Regression for Type II Diabetes Vazquez, Ana I. Klimentidis, Yann C. Dhurandhar, Emily J. Veturi, Yogasudha C. Paérez-Rodríguez, Paulino PLoS One Research Article Lifestyle and genetic factors play a large role in the development of Type 2 Diabetes (T2D). Despite the important role of genetic factors, genetic information is not incorporated into the clinical assessment of T2D risk. We assessed and compared Whole Genome Regression methods to predict the T2D status of 5,245 subjects from the Framingham Heart Study. For evaluating each method we constructed the following set of regression models: A clinical baseline model (CBM) which included non-genetic covariates only. CBM was extended by adding the first two marker-derived principal components and 65 SNPs identified by a recent GWAS consortium for T2D (M-65SNPs). Subsequently, it was further extended by adding 249,798 genome-wide SNPs from a high-density array. The Bayesian models used to incorporate genome-wide marker information as predictors were: Bayes A, Bayes Cπ, Bayesian LASSO (BL), and the Genomic Best Linear Unbiased Prediction (G-BLUP). Results included estimates of the genetic variance and heritability, genetic scores for T2D, and predictive ability evaluated in a 10-fold cross-validation. The predictive AUC estimates for CBM and M-65SNPs were: 0.668 and 0.684, respectively. We found evidence of contribution of genetic effects in T2D, as reflected in the genomic heritability estimates (0.492±0.066). The highest predictive AUC among the genome-wide marker Bayesian models was 0.681 for the Bayesian LASSO. Overall, the improvement in predictive ability was moderate and did not differ greatly among models that included genetic information. Approximately 58% of the total number of genetic variants was found to contribute to the overall genetic variation, indicating a complex genetic architecture for T2D. Our results suggest that the Bayes Cπ and the G-BLUP models with a large set of genome-wide markers could be used for predicting risk to T2D, as an alternative to using high-density arrays when selected markers from large consortiums for a given complex trait or disease are unavailable. Public Library of Science 2015-04-17 /pmc/articles/PMC4401705/ /pubmed/25885636 http://dx.doi.org/10.1371/journal.pone.0123818 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Vazquez, Ana I.
Klimentidis, Yann C.
Dhurandhar, Emily J.
Veturi, Yogasudha C.
Paérez-Rodríguez, Paulino
Assessment of Whole-Genome Regression for Type II Diabetes
title Assessment of Whole-Genome Regression for Type II Diabetes
title_full Assessment of Whole-Genome Regression for Type II Diabetes
title_fullStr Assessment of Whole-Genome Regression for Type II Diabetes
title_full_unstemmed Assessment of Whole-Genome Regression for Type II Diabetes
title_short Assessment of Whole-Genome Regression for Type II Diabetes
title_sort assessment of whole-genome regression for type ii diabetes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4401705/
https://www.ncbi.nlm.nih.gov/pubmed/25885636
http://dx.doi.org/10.1371/journal.pone.0123818
work_keys_str_mv AT vazquezanai assessmentofwholegenomeregressionfortypeiidiabetes
AT klimentidisyannc assessmentofwholegenomeregressionfortypeiidiabetes
AT dhurandharemilyj assessmentofwholegenomeregressionfortypeiidiabetes
AT veturiyogasudhac assessmentofwholegenomeregressionfortypeiidiabetes
AT paerezrodriguezpaulino assessmentofwholegenomeregressionfortypeiidiabetes