Cargando…
The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics
In recent years, there has been a considerable amount of research on the use of regularization methods for inference and prediction in quantitative genetics. Such research mostly focuses on selection of markers and shrinkage of their effects. In this review paper, the use of ridge regression for pre...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4529984/ https://www.ncbi.nlm.nih.gov/pubmed/26273586 http://dx.doi.org/10.1155/2015/143712 |
_version_ | 1782384853789966336 |
---|---|
author | de Vlaming, Ronald Groenen, Patrick J. F. |
author_facet | de Vlaming, Ronald Groenen, Patrick J. F. |
author_sort | de Vlaming, Ronald |
collection | PubMed |
description | In recent years, there has been a considerable amount of research on the use of regularization methods for inference and prediction in quantitative genetics. Such research mostly focuses on selection of markers and shrinkage of their effects. In this review paper, the use of ridge regression for prediction in quantitative genetics using single-nucleotide polymorphism data is discussed. In particular, we consider (i) the theoretical foundations of ridge regression, (ii) its link to commonly used methods in animal breeding, (iii) the computational feasibility, and (iv) the scope for constructing prediction models with nonlinear effects (e.g., dominance and epistasis). Based on a simulation study we gauge the current and future potential of ridge regression for prediction of human traits using genome-wide SNP data. We conclude that, for outcomes with a relatively simple genetic architecture, given current sample sizes in most cohorts (i.e., N < 10,000) the predictive accuracy of ridge regression is slightly higher than the classical genome-wide association study approach of repeated simple regression (i.e., one regression per SNP). However, both capture only a small proportion of the heritability. Nevertheless, we find evidence that for large-scale initiatives, such as biobanks, sample sizes can be achieved where ridge regression compared to the classical approach improves predictive accuracy substantially. |
format | Online Article Text |
id | pubmed-4529984 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-45299842015-08-13 The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics de Vlaming, Ronald Groenen, Patrick J. F. Biomed Res Int Review Article In recent years, there has been a considerable amount of research on the use of regularization methods for inference and prediction in quantitative genetics. Such research mostly focuses on selection of markers and shrinkage of their effects. In this review paper, the use of ridge regression for prediction in quantitative genetics using single-nucleotide polymorphism data is discussed. In particular, we consider (i) the theoretical foundations of ridge regression, (ii) its link to commonly used methods in animal breeding, (iii) the computational feasibility, and (iv) the scope for constructing prediction models with nonlinear effects (e.g., dominance and epistasis). Based on a simulation study we gauge the current and future potential of ridge regression for prediction of human traits using genome-wide SNP data. We conclude that, for outcomes with a relatively simple genetic architecture, given current sample sizes in most cohorts (i.e., N < 10,000) the predictive accuracy of ridge regression is slightly higher than the classical genome-wide association study approach of repeated simple regression (i.e., one regression per SNP). However, both capture only a small proportion of the heritability. Nevertheless, we find evidence that for large-scale initiatives, such as biobanks, sample sizes can be achieved where ridge regression compared to the classical approach improves predictive accuracy substantially. Hindawi Publishing Corporation 2015 2015-07-26 /pmc/articles/PMC4529984/ /pubmed/26273586 http://dx.doi.org/10.1155/2015/143712 Text en Copyright © 2015 R. de Vlaming and P. J. F. Groenen. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Review Article de Vlaming, Ronald Groenen, Patrick J. F. The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics |
title | The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics |
title_full | The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics |
title_fullStr | The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics |
title_full_unstemmed | The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics |
title_short | The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics |
title_sort | current and future use of ridge regression for prediction in quantitative genetics |
topic | Review Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4529984/ https://www.ncbi.nlm.nih.gov/pubmed/26273586 http://dx.doi.org/10.1155/2015/143712 |
work_keys_str_mv | AT devlamingronald thecurrentandfutureuseofridgeregressionforpredictioninquantitativegenetics AT groenenpatrickjf thecurrentandfutureuseofridgeregressionforpredictioninquantitativegenetics AT devlamingronald currentandfutureuseofridgeregressionforpredictioninquantitativegenetics AT groenenpatrickjf currentandfutureuseofridgeregressionforpredictioninquantitativegenetics |