Cargando…

Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat

The use of haplotypes may improve the accuracy of genomic prediction over single SNPs because haplotypes can better capture linkage disequilibrium and genomic similarity in different lines and may capture local high-order allelic interactions. Additionally, prediction accuracy could be improved by p...

Descripción completa

Detalles Bibliográficos
Autores principales: Sallam, Ahmad H., Conley, Emily, Prakapenka, Dzianis, Da, Yang, Anderson, James A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7341132/
https://www.ncbi.nlm.nih.gov/pubmed/32371453
http://dx.doi.org/10.1534/g3.120.401165
_version_ 1783555169869365248
author Sallam, Ahmad H.
Conley, Emily
Prakapenka, Dzianis
Da, Yang
Anderson, James A.
author_facet Sallam, Ahmad H.
Conley, Emily
Prakapenka, Dzianis
Da, Yang
Anderson, James A.
author_sort Sallam, Ahmad H.
collection PubMed
description The use of haplotypes may improve the accuracy of genomic prediction over single SNPs because haplotypes can better capture linkage disequilibrium and genomic similarity in different lines and may capture local high-order allelic interactions. Additionally, prediction accuracy could be improved by portraying population structure in the calibration set. A set of 383 advanced lines and cultivars that represent the diversity of the University of Minnesota wheat breeding program was phenotyped for yield, test weight, and protein content and genotyped using the Illumina 90K SNP Assay. Population structure was confirmed using single SNPs. Haplotype blocks of 5, 10, 15, and 20 adjacent markers were constructed for all chromosomes. A multi-allelic haplotype prediction algorithm was implemented and compared with single SNPs using both k-fold cross validation and stratified sampling optimization. After confirming population structure, the stratified sampling improved the predictive ability compared with k-fold cross validation for yield and protein content, but reduced the predictive ability for test weight. In all cases, haplotype predictions outperformed single SNPs. Haplotypes of 15 adjacent markers showed the best improvement in accuracy for all traits; however, this was more pronounced in yield and protein content. The combined use of haplotypes of 15 adjacent markers and training population optimization significantly improved the predictive ability for yield and protein content by 14.3 (four percentage points) and 16.8% (seven percentage points), respectively, compared with using single SNPs and k-fold cross validation. These results emphasize the effectiveness of using haplotypes in genomic selection to increase genetic gain in self-fertilized crops.
format Online
Article
Text
id pubmed-7341132
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-73411322020-07-21 Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat Sallam, Ahmad H. Conley, Emily Prakapenka, Dzianis Da, Yang Anderson, James A. G3 (Bethesda) Genomic Prediction The use of haplotypes may improve the accuracy of genomic prediction over single SNPs because haplotypes can better capture linkage disequilibrium and genomic similarity in different lines and may capture local high-order allelic interactions. Additionally, prediction accuracy could be improved by portraying population structure in the calibration set. A set of 383 advanced lines and cultivars that represent the diversity of the University of Minnesota wheat breeding program was phenotyped for yield, test weight, and protein content and genotyped using the Illumina 90K SNP Assay. Population structure was confirmed using single SNPs. Haplotype blocks of 5, 10, 15, and 20 adjacent markers were constructed for all chromosomes. A multi-allelic haplotype prediction algorithm was implemented and compared with single SNPs using both k-fold cross validation and stratified sampling optimization. After confirming population structure, the stratified sampling improved the predictive ability compared with k-fold cross validation for yield and protein content, but reduced the predictive ability for test weight. In all cases, haplotype predictions outperformed single SNPs. Haplotypes of 15 adjacent markers showed the best improvement in accuracy for all traits; however, this was more pronounced in yield and protein content. The combined use of haplotypes of 15 adjacent markers and training population optimization significantly improved the predictive ability for yield and protein content by 14.3 (four percentage points) and 16.8% (seven percentage points), respectively, compared with using single SNPs and k-fold cross validation. These results emphasize the effectiveness of using haplotypes in genomic selection to increase genetic gain in self-fertilized crops. Genetics Society of America 2020-05-05 /pmc/articles/PMC7341132/ /pubmed/32371453 http://dx.doi.org/10.1534/g3.120.401165 Text en Copyright © 2020 Sallam et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genomic Prediction
Sallam, Ahmad H.
Conley, Emily
Prakapenka, Dzianis
Da, Yang
Anderson, James A.
Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat
title Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat
title_full Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat
title_fullStr Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat
title_full_unstemmed Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat
title_short Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat
title_sort improving prediction accuracy using multi-allelic haplotype prediction and training population optimization in wheat
topic Genomic Prediction
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7341132/
https://www.ncbi.nlm.nih.gov/pubmed/32371453
http://dx.doi.org/10.1534/g3.120.401165
work_keys_str_mv AT sallamahmadh improvingpredictionaccuracyusingmultiallelichaplotypepredictionandtrainingpopulationoptimizationinwheat
AT conleyemily improvingpredictionaccuracyusingmultiallelichaplotypepredictionandtrainingpopulationoptimizationinwheat
AT prakapenkadzianis improvingpredictionaccuracyusingmultiallelichaplotypepredictionandtrainingpopulationoptimizationinwheat
AT dayang improvingpredictionaccuracyusingmultiallelichaplotypepredictionandtrainingpopulationoptimizationinwheat
AT andersonjamesa improvingpredictionaccuracyusingmultiallelichaplotypepredictionandtrainingpopulationoptimizationinwheat