Cargando…

Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy

Genomic selection applied to plant breeding enables earlier estimates of a line’s performance and significant reductions in generation interval. Several factors affecting prediction accuracy should be well understood if breeders are to harness genomic selection to its full potential. We used a panel...

Descripción completa

Detalles Bibliográficos
Autores principales: Norman, Adam, Taylor, Julian, Edwards, James, Kuchel, Haydn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6118301/
https://www.ncbi.nlm.nih.gov/pubmed/29970398
http://dx.doi.org/10.1534/g3.118.200311
_version_ 1783351905734361088
author Norman, Adam
Taylor, Julian
Edwards, James
Kuchel, Haydn
author_facet Norman, Adam
Taylor, Julian
Edwards, James
Kuchel, Haydn
author_sort Norman, Adam
collection PubMed
description Genomic selection applied to plant breeding enables earlier estimates of a line’s performance and significant reductions in generation interval. Several factors affecting prediction accuracy should be well understood if breeders are to harness genomic selection to its full potential. We used a panel of 10,375 bread wheat (Triticum aestivum) lines genotyped with 18,101 SNP markers to investigate the effect and interaction of training set size, population structure and marker density on genomic prediction accuracy. Through assessing the effect of training set size we showed the rate at which prediction accuracy increases is slower beyond approximately 2,000 lines. The structure of the panel was assessed via principal component analysis and K-means clustering, and its effect on prediction accuracy was examined through a novel cross-validation analysis according to the K-means clusters and breeding cohorts. Here we showed that accuracy can be improved by increasing the diversity within the training set, particularly when relatedness between training and validation sets is low. The breeding cohort analysis revealed that traits with higher selection pressure (lower allelic diversity) can be more accurately predicted by including several previous cohorts in the training set. The effect of marker density and its interaction with population structure was assessed for marker subsets containing between 100 and 17,181 markers. This analysis showed that response to increased marker density is largest when using a diverse training set to predict between poorly related material. These findings represent a significant resource for plant breeders and contribute to the collective knowledge on the optimal structure of calibration panels for genomic prediction.
format Online
Article
Text
id pubmed-6118301
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-61183012018-09-04 Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy Norman, Adam Taylor, Julian Edwards, James Kuchel, Haydn G3 (Bethesda) Genomic Selection Genomic selection applied to plant breeding enables earlier estimates of a line’s performance and significant reductions in generation interval. Several factors affecting prediction accuracy should be well understood if breeders are to harness genomic selection to its full potential. We used a panel of 10,375 bread wheat (Triticum aestivum) lines genotyped with 18,101 SNP markers to investigate the effect and interaction of training set size, population structure and marker density on genomic prediction accuracy. Through assessing the effect of training set size we showed the rate at which prediction accuracy increases is slower beyond approximately 2,000 lines. The structure of the panel was assessed via principal component analysis and K-means clustering, and its effect on prediction accuracy was examined through a novel cross-validation analysis according to the K-means clusters and breeding cohorts. Here we showed that accuracy can be improved by increasing the diversity within the training set, particularly when relatedness between training and validation sets is low. The breeding cohort analysis revealed that traits with higher selection pressure (lower allelic diversity) can be more accurately predicted by including several previous cohorts in the training set. The effect of marker density and its interaction with population structure was assessed for marker subsets containing between 100 and 17,181 markers. This analysis showed that response to increased marker density is largest when using a diverse training set to predict between poorly related material. These findings represent a significant resource for plant breeders and contribute to the collective knowledge on the optimal structure of calibration panels for genomic prediction. Genetics Society of America 2018-07-03 /pmc/articles/PMC6118301/ /pubmed/29970398 http://dx.doi.org/10.1534/g3.118.200311 Text en Copyright © 2018 Norman et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genomic Selection
Norman, Adam
Taylor, Julian
Edwards, James
Kuchel, Haydn
Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy
title Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy
title_full Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy
title_fullStr Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy
title_full_unstemmed Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy
title_short Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy
title_sort optimising genomic selection in wheat: effect of marker density, population size and population structure on prediction accuracy
topic Genomic Selection
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6118301/
https://www.ncbi.nlm.nih.gov/pubmed/29970398
http://dx.doi.org/10.1534/g3.118.200311
work_keys_str_mv AT normanadam optimisinggenomicselectioninwheateffectofmarkerdensitypopulationsizeandpopulationstructureonpredictionaccuracy
AT taylorjulian optimisinggenomicselectioninwheateffectofmarkerdensitypopulationsizeandpopulationstructureonpredictionaccuracy
AT edwardsjames optimisinggenomicselectioninwheateffectofmarkerdensitypopulationsizeandpopulationstructureonpredictionaccuracy
AT kuchelhaydn optimisinggenomicselectioninwheateffectofmarkerdensitypopulationsizeandpopulationstructureonpredictionaccuracy