Cargando…

Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction

Cross-validation of methods is an essential component of genome-enabled prediction of complex traits. We develop formulae for computing the predictions that would be obtained when one or several cases are removed in the training process, to become members of testing sets, but by running the model us...

Descripción completa

Detalles Bibliográficos
Autores principales:	Gianola, Daniel, Schön, Chris-Carolin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Genetics Society of America 2016
Materias:	Genomic Selection
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5068934/ https://www.ncbi.nlm.nih.gov/pubmed/27489209 http://dx.doi.org/10.1534/g3.116.033381

_version_	1782460866392752128
author	Gianola, Daniel Schön, Chris-Carolin
author_facet	Gianola, Daniel Schön, Chris-Carolin
author_sort	Gianola, Daniel
collection	PubMed
description	Cross-validation of methods is an essential component of genome-enabled prediction of complex traits. We develop formulae for computing the predictions that would be obtained when one or several cases are removed in the training process, to become members of testing sets, but by running the model using all observations only once. Prediction methods to which the developments apply include least squares, best linear unbiased prediction (BLUP) of markers, or genomic BLUP, reproducing kernels Hilbert spaces regression with single or multiple kernel matrices, and any member of a suite of linear regression methods known as “Bayesian alphabet.” The approach used for Bayesian models is based on importance sampling of posterior draws. Proof of concept is provided by applying the formulae to a wheat data set representing 599 inbred lines genotyped for 1279 markers, and the target trait was grain yield. The data set was used to evaluate predictive mean-squared error, impact of alternative layouts on maximum likelihood estimates of regularization parameters, model complexity, and residual degrees of freedom stemming from various strengths of regularization, as well as two forms of importance sampling. Our results will facilitate carrying out extensive cross-validation without model retraining for most machines employed in genome-assisted prediction of quantitative traits.
format	Online Article Text
id	pubmed-5068934
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Genetics Society of America
record_format	MEDLINE/PubMed
spelling	pubmed-50689342016-10-24 Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction Gianola, Daniel Schön, Chris-Carolin G3 (Bethesda) Genomic Selection Cross-validation of methods is an essential component of genome-enabled prediction of complex traits. We develop formulae for computing the predictions that would be obtained when one or several cases are removed in the training process, to become members of testing sets, but by running the model using all observations only once. Prediction methods to which the developments apply include least squares, best linear unbiased prediction (BLUP) of markers, or genomic BLUP, reproducing kernels Hilbert spaces regression with single or multiple kernel matrices, and any member of a suite of linear regression methods known as “Bayesian alphabet.” The approach used for Bayesian models is based on importance sampling of posterior draws. Proof of concept is provided by applying the formulae to a wheat data set representing 599 inbred lines genotyped for 1279 markers, and the target trait was grain yield. The data set was used to evaluate predictive mean-squared error, impact of alternative layouts on maximum likelihood estimates of regularization parameters, model complexity, and residual degrees of freedom stemming from various strengths of regularization, as well as two forms of importance sampling. Our results will facilitate carrying out extensive cross-validation without model retraining for most machines employed in genome-assisted prediction of quantitative traits. Genetics Society of America 2016-08-03 /pmc/articles/PMC5068934/ /pubmed/27489209 http://dx.doi.org/10.1534/g3.116.033381 Text en Copyright © 2016 Gianola and Schon http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Genomic Selection Gianola, Daniel Schön, Chris-Carolin Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction
title	Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction
title_full	Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction
title_fullStr	Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction
title_full_unstemmed	Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction
title_short	Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction
title_sort	cross-validation without doing cross-validation in genome-enabled prediction
topic	Genomic Selection
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5068934/ https://www.ncbi.nlm.nih.gov/pubmed/27489209 http://dx.doi.org/10.1534/g3.116.033381
work_keys_str_mv	AT gianoladaniel crossvalidationwithoutdoingcrossvalidationingenomeenabledprediction AT schonchriscarolin crossvalidationwithoutdoingcrossvalidationingenomeenabledprediction

Cross-Validation Without Doing Cross-Validation in Genome-Enabled Prediction

Ejemplares similares