Cargando…

The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics

In recent years, there has been a considerable amount of research on the use of regularization methods for inference and prediction in quantitative genetics. Such research mostly focuses on selection of markers and shrinkage of their effects. In this review paper, the use of ridge regression for pre...

Descripción completa

Detalles Bibliográficos
Autores principales: de Vlaming, Ronald, Groenen, Patrick J. F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4529984/
https://www.ncbi.nlm.nih.gov/pubmed/26273586
http://dx.doi.org/10.1155/2015/143712
_version_ 1782384853789966336
author de Vlaming, Ronald
Groenen, Patrick J. F.
author_facet de Vlaming, Ronald
Groenen, Patrick J. F.
author_sort de Vlaming, Ronald
collection PubMed
description In recent years, there has been a considerable amount of research on the use of regularization methods for inference and prediction in quantitative genetics. Such research mostly focuses on selection of markers and shrinkage of their effects. In this review paper, the use of ridge regression for prediction in quantitative genetics using single-nucleotide polymorphism data is discussed. In particular, we consider (i) the theoretical foundations of ridge regression, (ii) its link to commonly used methods in animal breeding, (iii) the computational feasibility, and (iv) the scope for constructing prediction models with nonlinear effects (e.g., dominance and epistasis). Based on a simulation study we gauge the current and future potential of ridge regression for prediction of human traits using genome-wide SNP data. We conclude that, for outcomes with a relatively simple genetic architecture, given current sample sizes in most cohorts (i.e., N < 10,000) the predictive accuracy of ridge regression is slightly higher than the classical genome-wide association study approach of repeated simple regression (i.e., one regression per SNP). However, both capture only a small proportion of the heritability. Nevertheless, we find evidence that for large-scale initiatives, such as biobanks, sample sizes can be achieved where ridge regression compared to the classical approach improves predictive accuracy substantially.
format Online
Article
Text
id pubmed-4529984
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-45299842015-08-13 The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics de Vlaming, Ronald Groenen, Patrick J. F. Biomed Res Int Review Article In recent years, there has been a considerable amount of research on the use of regularization methods for inference and prediction in quantitative genetics. Such research mostly focuses on selection of markers and shrinkage of their effects. In this review paper, the use of ridge regression for prediction in quantitative genetics using single-nucleotide polymorphism data is discussed. In particular, we consider (i) the theoretical foundations of ridge regression, (ii) its link to commonly used methods in animal breeding, (iii) the computational feasibility, and (iv) the scope for constructing prediction models with nonlinear effects (e.g., dominance and epistasis). Based on a simulation study we gauge the current and future potential of ridge regression for prediction of human traits using genome-wide SNP data. We conclude that, for outcomes with a relatively simple genetic architecture, given current sample sizes in most cohorts (i.e., N < 10,000) the predictive accuracy of ridge regression is slightly higher than the classical genome-wide association study approach of repeated simple regression (i.e., one regression per SNP). However, both capture only a small proportion of the heritability. Nevertheless, we find evidence that for large-scale initiatives, such as biobanks, sample sizes can be achieved where ridge regression compared to the classical approach improves predictive accuracy substantially. Hindawi Publishing Corporation 2015 2015-07-26 /pmc/articles/PMC4529984/ /pubmed/26273586 http://dx.doi.org/10.1155/2015/143712 Text en Copyright © 2015 R. de Vlaming and P. J. F. Groenen. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review Article
de Vlaming, Ronald
Groenen, Patrick J. F.
The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics
title The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics
title_full The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics
title_fullStr The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics
title_full_unstemmed The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics
title_short The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics
title_sort current and future use of ridge regression for prediction in quantitative genetics
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4529984/
https://www.ncbi.nlm.nih.gov/pubmed/26273586
http://dx.doi.org/10.1155/2015/143712
work_keys_str_mv AT devlamingronald thecurrentandfutureuseofridgeregressionforpredictioninquantitativegenetics
AT groenenpatrickjf thecurrentandfutureuseofridgeregressionforpredictioninquantitativegenetics
AT devlamingronald currentandfutureuseofridgeregressionforpredictioninquantitativegenetics
AT groenenpatrickjf currentandfutureuseofridgeregressionforpredictioninquantitativegenetics