Cargando…

Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals

Genome‐wide association studies (GWAS) have detected large numbers of variants associated with complex human traits and diseases. However, the proportion of variance explained by GWAS‐significant single nucleotide polymorphisms has been usually small. This brought interest in the use of whole‐genome...

Descripción completa

Detalles Bibliográficos
Autores principales: Berger, Swetlana, Pérez‐Rodríguez, Paulino, Veturi, Yogasudha, Simianer, Henner, de los Campos, Gustavo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428155/
https://www.ncbi.nlm.nih.gov/pubmed/25600682
http://dx.doi.org/10.1111/ahg.12099
_version_ 1782370848033734656
author Berger, Swetlana
Pérez‐Rodríguez, Paulino
Veturi, Yogasudha
Simianer, Henner
de los Campos, Gustavo
author_facet Berger, Swetlana
Pérez‐Rodríguez, Paulino
Veturi, Yogasudha
Simianer, Henner
de los Campos, Gustavo
author_sort Berger, Swetlana
collection PubMed
description Genome‐wide association studies (GWAS) have detected large numbers of variants associated with complex human traits and diseases. However, the proportion of variance explained by GWAS‐significant single nucleotide polymorphisms has been usually small. This brought interest in the use of whole‐genome regression (WGR) methods. However, there has been limited research on the factors that affect prediction accuracy (PA) of WGRs when applied to human data of distantly related individuals. Here, we examine, using real human genotypes and simulated phenotypes, how trait complexity, marker‐quantitative trait loci (QTL) linkage disequilibrium (LD), and the model used affect the performance of WGRs. Our results indicated that the estimated rate of missing heritability is dependent on the extent of marker‐QTL LD. However, this parameter was not greatly affected by trait complexity. Regarding PA our results indicated that: (a) under perfect marker‐QTL LD WGR can achieve moderately high prediction accuracy, and with simple genetic architectures variable selection methods outperform shrinkage procedures and (b) under imperfect marker‐QTL LD, variable selection methods can achieved reasonably good PA with simple or moderately complex genetic architectures; however, the PA of these methods deteriorated as trait complexity increases and with highly complex traits variable selection and shrinkage methods both performed poorly. This was confirmed with an analysis of human height.
format Online
Article
Text
id pubmed-4428155
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-44281552015-05-12 Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals Berger, Swetlana Pérez‐Rodríguez, Paulino Veturi, Yogasudha Simianer, Henner de los Campos, Gustavo Ann Hum Genet Original Articles Genome‐wide association studies (GWAS) have detected large numbers of variants associated with complex human traits and diseases. However, the proportion of variance explained by GWAS‐significant single nucleotide polymorphisms has been usually small. This brought interest in the use of whole‐genome regression (WGR) methods. However, there has been limited research on the factors that affect prediction accuracy (PA) of WGRs when applied to human data of distantly related individuals. Here, we examine, using real human genotypes and simulated phenotypes, how trait complexity, marker‐quantitative trait loci (QTL) linkage disequilibrium (LD), and the model used affect the performance of WGRs. Our results indicated that the estimated rate of missing heritability is dependent on the extent of marker‐QTL LD. However, this parameter was not greatly affected by trait complexity. Regarding PA our results indicated that: (a) under perfect marker‐QTL LD WGR can achieve moderately high prediction accuracy, and with simple genetic architectures variable selection methods outperform shrinkage procedures and (b) under imperfect marker‐QTL LD, variable selection methods can achieved reasonably good PA with simple or moderately complex genetic architectures; however, the PA of these methods deteriorated as trait complexity increases and with highly complex traits variable selection and shrinkage methods both performed poorly. This was confirmed with an analysis of human height. John Wiley and Sons Inc. 2015-01-20 2015-03 /pmc/articles/PMC4428155/ /pubmed/25600682 http://dx.doi.org/10.1111/ahg.12099 Text en © 2015 The Authors. Annals of Human Genetics published by University College London (UCL) and John Wiley & Sons Ltd This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Articles
Berger, Swetlana
Pérez‐Rodríguez, Paulino
Veturi, Yogasudha
Simianer, Henner
de los Campos, Gustavo
Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals
title Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals
title_full Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals
title_fullStr Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals
title_full_unstemmed Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals
title_short Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals
title_sort effectiveness of shrinkage and variable selection methods for the prediction of complex human traits using data from distantly related individuals
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428155/
https://www.ncbi.nlm.nih.gov/pubmed/25600682
http://dx.doi.org/10.1111/ahg.12099
work_keys_str_mv AT bergerswetlana effectivenessofshrinkageandvariableselectionmethodsforthepredictionofcomplexhumantraitsusingdatafromdistantlyrelatedindividuals
AT perezrodriguezpaulino effectivenessofshrinkageandvariableselectionmethodsforthepredictionofcomplexhumantraitsusingdatafromdistantlyrelatedindividuals
AT veturiyogasudha effectivenessofshrinkageandvariableselectionmethodsforthepredictionofcomplexhumantraitsusingdatafromdistantlyrelatedindividuals
AT simianerhenner effectivenessofshrinkageandvariableselectionmethodsforthepredictionofcomplexhumantraitsusingdatafromdistantlyrelatedindividuals
AT deloscamposgustavo effectivenessofshrinkageandvariableselectionmethodsforthepredictionofcomplexhumantraitsusingdatafromdistantlyrelatedindividuals