Cargando…

Controlling the Overfitting of Heritability in Genomic Selection through Cross Validation

In genomic selection (GS), all the markers across the entire genome are used to conduct marker-assisted selection such that each quantitative trait locus of complex trait is in linkage disequilibrium with at least one marker. Although GS improves estimated breeding values and genetic gain, in most G...

Descripción completa

Detalles Bibliográficos
Autor principal: Jia, Zhenyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5651917/
https://www.ncbi.nlm.nih.gov/pubmed/29057969
http://dx.doi.org/10.1038/s41598-017-14070-z
_version_ 1783272975813836800
author Jia, Zhenyu
author_facet Jia, Zhenyu
author_sort Jia, Zhenyu
collection PubMed
description In genomic selection (GS), all the markers across the entire genome are used to conduct marker-assisted selection such that each quantitative trait locus of complex trait is in linkage disequilibrium with at least one marker. Although GS improves estimated breeding values and genetic gain, in most GS models genetic variance is estimated from training samples with many trait-irrelevant markers, which leads to severe overfitting in the calculation of trait heritability. In this study, we demonstrated overfitting heritability due to the inclusion of trait-irrelevant markers using a series of simulations, and such overfitting can be effectively controlled by cross validation experiment. In the proposed method, the genetic variance is simply the variance of the genetic values predicted through cross validation, the residual variance is the variance of the differences between the observed phenotypic values and the predicted genetic values, and these two resultant variance components are used for calculating the unbiased heritability. We also demonstrated that the heritability calculated through cross validation is equivalent to trait predictability, which objectively reflects the applicability of the GS models. The proposed method can be implemented with the Mixed Procedure in SAS or with our R package “GSMX” which is publically available at https://cran.r-project.org/web/packages/GSMX/index.html.
format Online
Article
Text
id pubmed-5651917
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-56519172017-10-26 Controlling the Overfitting of Heritability in Genomic Selection through Cross Validation Jia, Zhenyu Sci Rep Article In genomic selection (GS), all the markers across the entire genome are used to conduct marker-assisted selection such that each quantitative trait locus of complex trait is in linkage disequilibrium with at least one marker. Although GS improves estimated breeding values and genetic gain, in most GS models genetic variance is estimated from training samples with many trait-irrelevant markers, which leads to severe overfitting in the calculation of trait heritability. In this study, we demonstrated overfitting heritability due to the inclusion of trait-irrelevant markers using a series of simulations, and such overfitting can be effectively controlled by cross validation experiment. In the proposed method, the genetic variance is simply the variance of the genetic values predicted through cross validation, the residual variance is the variance of the differences between the observed phenotypic values and the predicted genetic values, and these two resultant variance components are used for calculating the unbiased heritability. We also demonstrated that the heritability calculated through cross validation is equivalent to trait predictability, which objectively reflects the applicability of the GS models. The proposed method can be implemented with the Mixed Procedure in SAS or with our R package “GSMX” which is publically available at https://cran.r-project.org/web/packages/GSMX/index.html. Nature Publishing Group UK 2017-10-20 /pmc/articles/PMC5651917/ /pubmed/29057969 http://dx.doi.org/10.1038/s41598-017-14070-z Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Jia, Zhenyu
Controlling the Overfitting of Heritability in Genomic Selection through Cross Validation
title Controlling the Overfitting of Heritability in Genomic Selection through Cross Validation
title_full Controlling the Overfitting of Heritability in Genomic Selection through Cross Validation
title_fullStr Controlling the Overfitting of Heritability in Genomic Selection through Cross Validation
title_full_unstemmed Controlling the Overfitting of Heritability in Genomic Selection through Cross Validation
title_short Controlling the Overfitting of Heritability in Genomic Selection through Cross Validation
title_sort controlling the overfitting of heritability in genomic selection through cross validation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5651917/
https://www.ncbi.nlm.nih.gov/pubmed/29057969
http://dx.doi.org/10.1038/s41598-017-14070-z
work_keys_str_mv AT jiazhenyu controllingtheoverfittingofheritabilityingenomicselectionthroughcrossvalidation