Cargando…

A Unified and Comprehensible View of Parametric and Kernel Methods for Genomic Prediction with Application to Rice

One objective of this study was to provide readers with a clear and unified understanding of parametric statistical and kernel methods, used for genomic prediction, and to compare some of these in the context of rice breeding for quantitative traits. Furthermore, another objective was to provide a s...

Descripción completa

Detalles Bibliográficos
Autores principales: Jacquin, Laval, Cao, Tuong-Vi, Ahmadi, Nourollah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4977290/
https://www.ncbi.nlm.nih.gov/pubmed/27555865
http://dx.doi.org/10.3389/fgene.2016.00145
_version_ 1782446999282384896
author Jacquin, Laval
Cao, Tuong-Vi
Ahmadi, Nourollah
author_facet Jacquin, Laval
Cao, Tuong-Vi
Ahmadi, Nourollah
author_sort Jacquin, Laval
collection PubMed
description One objective of this study was to provide readers with a clear and unified understanding of parametric statistical and kernel methods, used for genomic prediction, and to compare some of these in the context of rice breeding for quantitative traits. Furthermore, another objective was to provide a simple and user-friendly R package, named KRMM, which allows users to perform RKHS regression with several kernels. After introducing the concept of regularized empirical risk minimization, the connections between well-known parametric and kernel methods such as Ridge regression [i.e., genomic best linear unbiased predictor (GBLUP)] and reproducing kernel Hilbert space (RKHS) regression were reviewed. Ridge regression was then reformulated so as to show and emphasize the advantage of the kernel “trick” concept, exploited by kernel methods in the context of epistatic genetic architectures, over parametric frameworks used by conventional methods. Some parametric and kernel methods; least absolute shrinkage and selection operator (LASSO), GBLUP, support vector machine regression (SVR) and RKHS regression were thereupon compared for their genomic predictive ability in the context of rice breeding using three real data sets. Among the compared methods, RKHS regression and SVR were often the most accurate methods for prediction followed by GBLUP and LASSO. An R function which allows users to perform RR-BLUP of marker effects, GBLUP and RKHS regression, with a Gaussian, Laplacian, polynomial or ANOVA kernel, in a reasonable computation time has been developed. Moreover, a modified version of this function, which allows users to tune kernels for RKHS regression, has also been developed and parallelized for HPC Linux clusters. The corresponding KRMM package and all scripts have been made publicly available.
format Online
Article
Text
id pubmed-4977290
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-49772902016-08-23 A Unified and Comprehensible View of Parametric and Kernel Methods for Genomic Prediction with Application to Rice Jacquin, Laval Cao, Tuong-Vi Ahmadi, Nourollah Front Genet Genetics One objective of this study was to provide readers with a clear and unified understanding of parametric statistical and kernel methods, used for genomic prediction, and to compare some of these in the context of rice breeding for quantitative traits. Furthermore, another objective was to provide a simple and user-friendly R package, named KRMM, which allows users to perform RKHS regression with several kernels. After introducing the concept of regularized empirical risk minimization, the connections between well-known parametric and kernel methods such as Ridge regression [i.e., genomic best linear unbiased predictor (GBLUP)] and reproducing kernel Hilbert space (RKHS) regression were reviewed. Ridge regression was then reformulated so as to show and emphasize the advantage of the kernel “trick” concept, exploited by kernel methods in the context of epistatic genetic architectures, over parametric frameworks used by conventional methods. Some parametric and kernel methods; least absolute shrinkage and selection operator (LASSO), GBLUP, support vector machine regression (SVR) and RKHS regression were thereupon compared for their genomic predictive ability in the context of rice breeding using three real data sets. Among the compared methods, RKHS regression and SVR were often the most accurate methods for prediction followed by GBLUP and LASSO. An R function which allows users to perform RR-BLUP of marker effects, GBLUP and RKHS regression, with a Gaussian, Laplacian, polynomial or ANOVA kernel, in a reasonable computation time has been developed. Moreover, a modified version of this function, which allows users to tune kernels for RKHS regression, has also been developed and parallelized for HPC Linux clusters. The corresponding KRMM package and all scripts have been made publicly available. Frontiers Media S.A. 2016-08-09 /pmc/articles/PMC4977290/ /pubmed/27555865 http://dx.doi.org/10.3389/fgene.2016.00145 Text en Copyright © 2016 Jacquin, Cao and Ahmadi. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Jacquin, Laval
Cao, Tuong-Vi
Ahmadi, Nourollah
A Unified and Comprehensible View of Parametric and Kernel Methods for Genomic Prediction with Application to Rice
title A Unified and Comprehensible View of Parametric and Kernel Methods for Genomic Prediction with Application to Rice
title_full A Unified and Comprehensible View of Parametric and Kernel Methods for Genomic Prediction with Application to Rice
title_fullStr A Unified and Comprehensible View of Parametric and Kernel Methods for Genomic Prediction with Application to Rice
title_full_unstemmed A Unified and Comprehensible View of Parametric and Kernel Methods for Genomic Prediction with Application to Rice
title_short A Unified and Comprehensible View of Parametric and Kernel Methods for Genomic Prediction with Application to Rice
title_sort unified and comprehensible view of parametric and kernel methods for genomic prediction with application to rice
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4977290/
https://www.ncbi.nlm.nih.gov/pubmed/27555865
http://dx.doi.org/10.3389/fgene.2016.00145
work_keys_str_mv AT jacquinlaval aunifiedandcomprehensibleviewofparametricandkernelmethodsforgenomicpredictionwithapplicationtorice
AT caotuongvi aunifiedandcomprehensibleviewofparametricandkernelmethodsforgenomicpredictionwithapplicationtorice
AT ahmadinourollah aunifiedandcomprehensibleviewofparametricandkernelmethodsforgenomicpredictionwithapplicationtorice
AT jacquinlaval unifiedandcomprehensibleviewofparametricandkernelmethodsforgenomicpredictionwithapplicationtorice
AT caotuongvi unifiedandcomprehensibleviewofparametricandkernelmethodsforgenomicpredictionwithapplicationtorice
AT ahmadinourollah unifiedandcomprehensibleviewofparametricandkernelmethodsforgenomicpredictionwithapplicationtorice