Cargando…

How Population Structure Impacts Genomic Selection Accuracy in Cross-Validation: Implications for Practical Breeding

Over the last two decades, the application of genomic selection has been extensively studied in various crop species, and it has become a common practice to report prediction accuracies using cross validation. However, genomic prediction accuracies obtained from random cross validation can be strong...

Descripción completa

Detalles Bibliográficos
Autores principales: Werner, Christian R., Gaynor, R. Chris, Gorjanc, Gregor, Hickey, John M., Kox, Tobias, Abbadi, Amine, Leckband, Gunhild, Snowdon, Rod J., Stahl, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7772221/
https://www.ncbi.nlm.nih.gov/pubmed/33391305
http://dx.doi.org/10.3389/fpls.2020.592977
_version_ 1783629831205814272
author Werner, Christian R.
Gaynor, R. Chris
Gorjanc, Gregor
Hickey, John M.
Kox, Tobias
Abbadi, Amine
Leckband, Gunhild
Snowdon, Rod J.
Stahl, Andreas
author_facet Werner, Christian R.
Gaynor, R. Chris
Gorjanc, Gregor
Hickey, John M.
Kox, Tobias
Abbadi, Amine
Leckband, Gunhild
Snowdon, Rod J.
Stahl, Andreas
author_sort Werner, Christian R.
collection PubMed
description Over the last two decades, the application of genomic selection has been extensively studied in various crop species, and it has become a common practice to report prediction accuracies using cross validation. However, genomic prediction accuracies obtained from random cross validation can be strongly inflated due to population or family structure, a characteristic shared by many breeding populations. An understanding of the effect of population and family structure on prediction accuracy is essential for the successful application of genomic selection in plant breeding programs. The objective of this study was to make this effect and its implications for practical breeding programs comprehensible for breeders and scientists with a limited background in quantitative genetics and genomic selection theory. We, therefore, compared genomic prediction accuracies obtained from different random cross validation approaches and within-family prediction in three different prediction scenarios. We used a highly structured population of 940 Brassica napus hybrids coming from 46 testcross families and two subpopulations. Our demonstrations show how genomic prediction accuracies obtained from among-family predictions in random cross validation and within-family predictions capture different measures of prediction accuracy. While among-family prediction accuracy measures prediction accuracy of both the parent average component and the Mendelian sampling term, within-family prediction only measures how accurately the Mendelian sampling term can be predicted. With this paper we aim to foster a critical approach to different measures of genomic prediction accuracy and a careful analysis of values observed in genomic selection experiments and reported in literature.
format Online
Article
Text
id pubmed-7772221
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-77722212020-12-31 How Population Structure Impacts Genomic Selection Accuracy in Cross-Validation: Implications for Practical Breeding Werner, Christian R. Gaynor, R. Chris Gorjanc, Gregor Hickey, John M. Kox, Tobias Abbadi, Amine Leckband, Gunhild Snowdon, Rod J. Stahl, Andreas Front Plant Sci Plant Science Over the last two decades, the application of genomic selection has been extensively studied in various crop species, and it has become a common practice to report prediction accuracies using cross validation. However, genomic prediction accuracies obtained from random cross validation can be strongly inflated due to population or family structure, a characteristic shared by many breeding populations. An understanding of the effect of population and family structure on prediction accuracy is essential for the successful application of genomic selection in plant breeding programs. The objective of this study was to make this effect and its implications for practical breeding programs comprehensible for breeders and scientists with a limited background in quantitative genetics and genomic selection theory. We, therefore, compared genomic prediction accuracies obtained from different random cross validation approaches and within-family prediction in three different prediction scenarios. We used a highly structured population of 940 Brassica napus hybrids coming from 46 testcross families and two subpopulations. Our demonstrations show how genomic prediction accuracies obtained from among-family predictions in random cross validation and within-family predictions capture different measures of prediction accuracy. While among-family prediction accuracy measures prediction accuracy of both the parent average component and the Mendelian sampling term, within-family prediction only measures how accurately the Mendelian sampling term can be predicted. With this paper we aim to foster a critical approach to different measures of genomic prediction accuracy and a careful analysis of values observed in genomic selection experiments and reported in literature. Frontiers Media S.A. 2020-12-16 /pmc/articles/PMC7772221/ /pubmed/33391305 http://dx.doi.org/10.3389/fpls.2020.592977 Text en Copyright © 2020 Werner, Gaynor, Gorjanc, Hickey, Kox, Abbadi, Leckband, Snowdon and Stahl. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Werner, Christian R.
Gaynor, R. Chris
Gorjanc, Gregor
Hickey, John M.
Kox, Tobias
Abbadi, Amine
Leckband, Gunhild
Snowdon, Rod J.
Stahl, Andreas
How Population Structure Impacts Genomic Selection Accuracy in Cross-Validation: Implications for Practical Breeding
title How Population Structure Impacts Genomic Selection Accuracy in Cross-Validation: Implications for Practical Breeding
title_full How Population Structure Impacts Genomic Selection Accuracy in Cross-Validation: Implications for Practical Breeding
title_fullStr How Population Structure Impacts Genomic Selection Accuracy in Cross-Validation: Implications for Practical Breeding
title_full_unstemmed How Population Structure Impacts Genomic Selection Accuracy in Cross-Validation: Implications for Practical Breeding
title_short How Population Structure Impacts Genomic Selection Accuracy in Cross-Validation: Implications for Practical Breeding
title_sort how population structure impacts genomic selection accuracy in cross-validation: implications for practical breeding
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7772221/
https://www.ncbi.nlm.nih.gov/pubmed/33391305
http://dx.doi.org/10.3389/fpls.2020.592977
work_keys_str_mv AT wernerchristianr howpopulationstructureimpactsgenomicselectionaccuracyincrossvalidationimplicationsforpracticalbreeding
AT gaynorrchris howpopulationstructureimpactsgenomicselectionaccuracyincrossvalidationimplicationsforpracticalbreeding
AT gorjancgregor howpopulationstructureimpactsgenomicselectionaccuracyincrossvalidationimplicationsforpracticalbreeding
AT hickeyjohnm howpopulationstructureimpactsgenomicselectionaccuracyincrossvalidationimplicationsforpracticalbreeding
AT koxtobias howpopulationstructureimpactsgenomicselectionaccuracyincrossvalidationimplicationsforpracticalbreeding
AT abbadiamine howpopulationstructureimpactsgenomicselectionaccuracyincrossvalidationimplicationsforpracticalbreeding
AT leckbandgunhild howpopulationstructureimpactsgenomicselectionaccuracyincrossvalidationimplicationsforpracticalbreeding
AT snowdonrodj howpopulationstructureimpactsgenomicselectionaccuracyincrossvalidationimplicationsforpracticalbreeding
AT stahlandreas howpopulationstructureimpactsgenomicselectionaccuracyincrossvalidationimplicationsforpracticalbreeding