Cargando…
Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat
The use of haplotypes may improve the accuracy of genomic prediction over single SNPs because haplotypes can better capture linkage disequilibrium and genomic similarity in different lines and may capture local high-order allelic interactions. Additionally, prediction accuracy could be improved by p...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7341132/ https://www.ncbi.nlm.nih.gov/pubmed/32371453 http://dx.doi.org/10.1534/g3.120.401165 |
_version_ | 1783555169869365248 |
---|---|
author | Sallam, Ahmad H. Conley, Emily Prakapenka, Dzianis Da, Yang Anderson, James A. |
author_facet | Sallam, Ahmad H. Conley, Emily Prakapenka, Dzianis Da, Yang Anderson, James A. |
author_sort | Sallam, Ahmad H. |
collection | PubMed |
description | The use of haplotypes may improve the accuracy of genomic prediction over single SNPs because haplotypes can better capture linkage disequilibrium and genomic similarity in different lines and may capture local high-order allelic interactions. Additionally, prediction accuracy could be improved by portraying population structure in the calibration set. A set of 383 advanced lines and cultivars that represent the diversity of the University of Minnesota wheat breeding program was phenotyped for yield, test weight, and protein content and genotyped using the Illumina 90K SNP Assay. Population structure was confirmed using single SNPs. Haplotype blocks of 5, 10, 15, and 20 adjacent markers were constructed for all chromosomes. A multi-allelic haplotype prediction algorithm was implemented and compared with single SNPs using both k-fold cross validation and stratified sampling optimization. After confirming population structure, the stratified sampling improved the predictive ability compared with k-fold cross validation for yield and protein content, but reduced the predictive ability for test weight. In all cases, haplotype predictions outperformed single SNPs. Haplotypes of 15 adjacent markers showed the best improvement in accuracy for all traits; however, this was more pronounced in yield and protein content. The combined use of haplotypes of 15 adjacent markers and training population optimization significantly improved the predictive ability for yield and protein content by 14.3 (four percentage points) and 16.8% (seven percentage points), respectively, compared with using single SNPs and k-fold cross validation. These results emphasize the effectiveness of using haplotypes in genomic selection to increase genetic gain in self-fertilized crops. |
format | Online Article Text |
id | pubmed-7341132 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-73411322020-07-21 Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat Sallam, Ahmad H. Conley, Emily Prakapenka, Dzianis Da, Yang Anderson, James A. G3 (Bethesda) Genomic Prediction The use of haplotypes may improve the accuracy of genomic prediction over single SNPs because haplotypes can better capture linkage disequilibrium and genomic similarity in different lines and may capture local high-order allelic interactions. Additionally, prediction accuracy could be improved by portraying population structure in the calibration set. A set of 383 advanced lines and cultivars that represent the diversity of the University of Minnesota wheat breeding program was phenotyped for yield, test weight, and protein content and genotyped using the Illumina 90K SNP Assay. Population structure was confirmed using single SNPs. Haplotype blocks of 5, 10, 15, and 20 adjacent markers were constructed for all chromosomes. A multi-allelic haplotype prediction algorithm was implemented and compared with single SNPs using both k-fold cross validation and stratified sampling optimization. After confirming population structure, the stratified sampling improved the predictive ability compared with k-fold cross validation for yield and protein content, but reduced the predictive ability for test weight. In all cases, haplotype predictions outperformed single SNPs. Haplotypes of 15 adjacent markers showed the best improvement in accuracy for all traits; however, this was more pronounced in yield and protein content. The combined use of haplotypes of 15 adjacent markers and training population optimization significantly improved the predictive ability for yield and protein content by 14.3 (four percentage points) and 16.8% (seven percentage points), respectively, compared with using single SNPs and k-fold cross validation. These results emphasize the effectiveness of using haplotypes in genomic selection to increase genetic gain in self-fertilized crops. Genetics Society of America 2020-05-05 /pmc/articles/PMC7341132/ /pubmed/32371453 http://dx.doi.org/10.1534/g3.120.401165 Text en Copyright © 2020 Sallam et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Genomic Prediction Sallam, Ahmad H. Conley, Emily Prakapenka, Dzianis Da, Yang Anderson, James A. Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat |
title | Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat |
title_full | Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat |
title_fullStr | Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat |
title_full_unstemmed | Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat |
title_short | Improving Prediction Accuracy Using Multi-allelic Haplotype Prediction and Training Population Optimization in Wheat |
title_sort | improving prediction accuracy using multi-allelic haplotype prediction and training population optimization in wheat |
topic | Genomic Prediction |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7341132/ https://www.ncbi.nlm.nih.gov/pubmed/32371453 http://dx.doi.org/10.1534/g3.120.401165 |
work_keys_str_mv | AT sallamahmadh improvingpredictionaccuracyusingmultiallelichaplotypepredictionandtrainingpopulationoptimizationinwheat AT conleyemily improvingpredictionaccuracyusingmultiallelichaplotypepredictionandtrainingpopulationoptimizationinwheat AT prakapenkadzianis improvingpredictionaccuracyusingmultiallelichaplotypepredictionandtrainingpopulationoptimizationinwheat AT dayang improvingpredictionaccuracyusingmultiallelichaplotypepredictionandtrainingpopulationoptimizationinwheat AT andersonjamesa improvingpredictionaccuracyusingmultiallelichaplotypepredictionandtrainingpopulationoptimizationinwheat |