Cargando…

Exploring the Link Between Additive Heritability and Prediction Accuracy From a Ridge Regression Perspective

Genome-Wide Association Studies (GWAS) explain only a small fraction of heritability for most complex human phenotypes. Genomic heritability estimates the variance explained by the SNPs on the whole genome using mixed models and accounts for the many small contributions of SNPs in the explanation of...

Descripción completa

Detalles Bibliográficos
Autores principales: Frouin, Arthur, Dandine-Roulland, Claire, Pierre-Jean, Morgane, Deleuze, Jean-François, Ambroise, Christophe, Le Floch, Edith
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7672157/
https://www.ncbi.nlm.nih.gov/pubmed/33329721
http://dx.doi.org/10.3389/fgene.2020.581594
_version_ 1783611071190269952
author Frouin, Arthur
Dandine-Roulland, Claire
Pierre-Jean, Morgane
Deleuze, Jean-François
Ambroise, Christophe
Le Floch, Edith
author_facet Frouin, Arthur
Dandine-Roulland, Claire
Pierre-Jean, Morgane
Deleuze, Jean-François
Ambroise, Christophe
Le Floch, Edith
author_sort Frouin, Arthur
collection PubMed
description Genome-Wide Association Studies (GWAS) explain only a small fraction of heritability for most complex human phenotypes. Genomic heritability estimates the variance explained by the SNPs on the whole genome using mixed models and accounts for the many small contributions of SNPs in the explanation of a phenotype. This paper approaches heritability from a machine learning perspective, and examines the close link between mixed models and ridge regression. Our contribution is two-fold. First, we propose estimating genomic heritability using a predictive approach via ridge regression and Generalized Cross Validation (GCV). We show that this is consistent with classical mixed model based estimation. Second, we derive simple formulae that express prediction accuracy as a function of the ratio [Formula: see text] , where n is the population size and p the total number of SNPs. These formulae clearly show that a high heritability does not imply an accurate prediction when p > n. Both the estimation of heritability via GCV and the prediction accuracy formulae are validated using simulated data and real data from UK Biobank.
format Online
Article
Text
id pubmed-7672157
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-76721572020-12-15 Exploring the Link Between Additive Heritability and Prediction Accuracy From a Ridge Regression Perspective Frouin, Arthur Dandine-Roulland, Claire Pierre-Jean, Morgane Deleuze, Jean-François Ambroise, Christophe Le Floch, Edith Front Genet Genetics Genome-Wide Association Studies (GWAS) explain only a small fraction of heritability for most complex human phenotypes. Genomic heritability estimates the variance explained by the SNPs on the whole genome using mixed models and accounts for the many small contributions of SNPs in the explanation of a phenotype. This paper approaches heritability from a machine learning perspective, and examines the close link between mixed models and ridge regression. Our contribution is two-fold. First, we propose estimating genomic heritability using a predictive approach via ridge regression and Generalized Cross Validation (GCV). We show that this is consistent with classical mixed model based estimation. Second, we derive simple formulae that express prediction accuracy as a function of the ratio [Formula: see text] , where n is the population size and p the total number of SNPs. These formulae clearly show that a high heritability does not imply an accurate prediction when p > n. Both the estimation of heritability via GCV and the prediction accuracy formulae are validated using simulated data and real data from UK Biobank. Frontiers Media S.A. 2020-11-04 /pmc/articles/PMC7672157/ /pubmed/33329721 http://dx.doi.org/10.3389/fgene.2020.581594 Text en Copyright © 2020 Frouin, Dandine-Roulland, Pierre-Jean, Deleuze, Ambroise and Le Floch. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Frouin, Arthur
Dandine-Roulland, Claire
Pierre-Jean, Morgane
Deleuze, Jean-François
Ambroise, Christophe
Le Floch, Edith
Exploring the Link Between Additive Heritability and Prediction Accuracy From a Ridge Regression Perspective
title Exploring the Link Between Additive Heritability and Prediction Accuracy From a Ridge Regression Perspective
title_full Exploring the Link Between Additive Heritability and Prediction Accuracy From a Ridge Regression Perspective
title_fullStr Exploring the Link Between Additive Heritability and Prediction Accuracy From a Ridge Regression Perspective
title_full_unstemmed Exploring the Link Between Additive Heritability and Prediction Accuracy From a Ridge Regression Perspective
title_short Exploring the Link Between Additive Heritability and Prediction Accuracy From a Ridge Regression Perspective
title_sort exploring the link between additive heritability and prediction accuracy from a ridge regression perspective
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7672157/
https://www.ncbi.nlm.nih.gov/pubmed/33329721
http://dx.doi.org/10.3389/fgene.2020.581594
work_keys_str_mv AT frouinarthur exploringthelinkbetweenadditiveheritabilityandpredictionaccuracyfromaridgeregressionperspective
AT dandineroullandclaire exploringthelinkbetweenadditiveheritabilityandpredictionaccuracyfromaridgeregressionperspective
AT pierrejeanmorgane exploringthelinkbetweenadditiveheritabilityandpredictionaccuracyfromaridgeregressionperspective
AT deleuzejeanfrancois exploringthelinkbetweenadditiveheritabilityandpredictionaccuracyfromaridgeregressionperspective
AT ambroisechristophe exploringthelinkbetweenadditiveheritabilityandpredictionaccuracyfromaridgeregressionperspective
AT leflochedith exploringthelinkbetweenadditiveheritabilityandpredictionaccuracyfromaridgeregressionperspective