Cargando…
Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals
Genome‐wide association studies (GWAS) have detected large numbers of variants associated with complex human traits and diseases. However, the proportion of variance explained by GWAS‐significant single nucleotide polymorphisms has been usually small. This brought interest in the use of whole‐genome...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428155/ https://www.ncbi.nlm.nih.gov/pubmed/25600682 http://dx.doi.org/10.1111/ahg.12099 |
_version_ | 1782370848033734656 |
---|---|
author | Berger, Swetlana Pérez‐Rodríguez, Paulino Veturi, Yogasudha Simianer, Henner de los Campos, Gustavo |
author_facet | Berger, Swetlana Pérez‐Rodríguez, Paulino Veturi, Yogasudha Simianer, Henner de los Campos, Gustavo |
author_sort | Berger, Swetlana |
collection | PubMed |
description | Genome‐wide association studies (GWAS) have detected large numbers of variants associated with complex human traits and diseases. However, the proportion of variance explained by GWAS‐significant single nucleotide polymorphisms has been usually small. This brought interest in the use of whole‐genome regression (WGR) methods. However, there has been limited research on the factors that affect prediction accuracy (PA) of WGRs when applied to human data of distantly related individuals. Here, we examine, using real human genotypes and simulated phenotypes, how trait complexity, marker‐quantitative trait loci (QTL) linkage disequilibrium (LD), and the model used affect the performance of WGRs. Our results indicated that the estimated rate of missing heritability is dependent on the extent of marker‐QTL LD. However, this parameter was not greatly affected by trait complexity. Regarding PA our results indicated that: (a) under perfect marker‐QTL LD WGR can achieve moderately high prediction accuracy, and with simple genetic architectures variable selection methods outperform shrinkage procedures and (b) under imperfect marker‐QTL LD, variable selection methods can achieved reasonably good PA with simple or moderately complex genetic architectures; however, the PA of these methods deteriorated as trait complexity increases and with highly complex traits variable selection and shrinkage methods both performed poorly. This was confirmed with an analysis of human height. |
format | Online Article Text |
id | pubmed-4428155 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-44281552015-05-12 Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals Berger, Swetlana Pérez‐Rodríguez, Paulino Veturi, Yogasudha Simianer, Henner de los Campos, Gustavo Ann Hum Genet Original Articles Genome‐wide association studies (GWAS) have detected large numbers of variants associated with complex human traits and diseases. However, the proportion of variance explained by GWAS‐significant single nucleotide polymorphisms has been usually small. This brought interest in the use of whole‐genome regression (WGR) methods. However, there has been limited research on the factors that affect prediction accuracy (PA) of WGRs when applied to human data of distantly related individuals. Here, we examine, using real human genotypes and simulated phenotypes, how trait complexity, marker‐quantitative trait loci (QTL) linkage disequilibrium (LD), and the model used affect the performance of WGRs. Our results indicated that the estimated rate of missing heritability is dependent on the extent of marker‐QTL LD. However, this parameter was not greatly affected by trait complexity. Regarding PA our results indicated that: (a) under perfect marker‐QTL LD WGR can achieve moderately high prediction accuracy, and with simple genetic architectures variable selection methods outperform shrinkage procedures and (b) under imperfect marker‐QTL LD, variable selection methods can achieved reasonably good PA with simple or moderately complex genetic architectures; however, the PA of these methods deteriorated as trait complexity increases and with highly complex traits variable selection and shrinkage methods both performed poorly. This was confirmed with an analysis of human height. John Wiley and Sons Inc. 2015-01-20 2015-03 /pmc/articles/PMC4428155/ /pubmed/25600682 http://dx.doi.org/10.1111/ahg.12099 Text en © 2015 The Authors. Annals of Human Genetics published by University College London (UCL) and John Wiley & Sons Ltd This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Articles Berger, Swetlana Pérez‐Rodríguez, Paulino Veturi, Yogasudha Simianer, Henner de los Campos, Gustavo Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals |
title | Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals |
title_full | Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals |
title_fullStr | Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals |
title_full_unstemmed | Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals |
title_short | Effectiveness of Shrinkage and Variable Selection Methods for the Prediction of Complex Human Traits using Data from Distantly Related Individuals |
title_sort | effectiveness of shrinkage and variable selection methods for the prediction of complex human traits using data from distantly related individuals |
topic | Original Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4428155/ https://www.ncbi.nlm.nih.gov/pubmed/25600682 http://dx.doi.org/10.1111/ahg.12099 |
work_keys_str_mv | AT bergerswetlana effectivenessofshrinkageandvariableselectionmethodsforthepredictionofcomplexhumantraitsusingdatafromdistantlyrelatedindividuals AT perezrodriguezpaulino effectivenessofshrinkageandvariableselectionmethodsforthepredictionofcomplexhumantraitsusingdatafromdistantlyrelatedindividuals AT veturiyogasudha effectivenessofshrinkageandvariableselectionmethodsforthepredictionofcomplexhumantraitsusingdatafromdistantlyrelatedindividuals AT simianerhenner effectivenessofshrinkageandvariableselectionmethodsforthepredictionofcomplexhumantraitsusingdatafromdistantlyrelatedindividuals AT deloscamposgustavo effectivenessofshrinkageandvariableselectionmethodsforthepredictionofcomplexhumantraitsusingdatafromdistantlyrelatedindividuals |