Cargando…

Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets

In modern plant breeding, genomic selection is becoming the gold standard for selection of superior genotypes. The basis for genomic prediction models is a set of phenotyped lines along with their genotypic profile. With high marker density and linkage disequilibrium (LD) between markers, genotype d...

Descripción completa

Detalles Bibliográficos
Autores principales:	Weber, Sven E., Frisch, Matthias, Snowdon, Rod J., Voss-Fels, Kai P.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2023
Materias:	Plant Science
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10507710/ https://www.ncbi.nlm.nih.gov/pubmed/37731980 http://dx.doi.org/10.3389/fpls.2023.1217589

_version_	1785107371856494592
author	Weber, Sven E. Frisch, Matthias Snowdon, Rod J. Voss-Fels, Kai P.
author_facet	Weber, Sven E. Frisch, Matthias Snowdon, Rod J. Voss-Fels, Kai P.
author_sort	Weber, Sven E.
collection	PubMed
description	In modern plant breeding, genomic selection is becoming the gold standard for selection of superior genotypes. The basis for genomic prediction models is a set of phenotyped lines along with their genotypic profile. With high marker density and linkage disequilibrium (LD) between markers, genotype data in breeding populations tends to exhibit considerable redundancy. Therefore, interest is growing in the use of haplotype blocks to overcome redundancy by summarizing co-inherited features. Moreover, haplotype blocks can help to capture local epistasis caused by interacting loci. Here, we compared genomic prediction methods that either used single SNPs or haplotype blocks with regards to their prediction accuracy for important traits in crop datasets. We used four published datasets from canola, maize, wheat and soybean. Different approaches to construct haplotype blocks were compared, including blocks based on LD, physical distance, number of adjacent markers and the algorithms implemented in the software “Haploview” and “HaploBlocker”. The tested prediction methods included Genomic Best Linear Unbiased Prediction (GBLUP), Extended GBLUP to account for additive by additive epistasis (EGBLUP), Bayesian LASSO and Reproducing Kernel Hilbert Space (RKHS) regression. We found improved prediction accuracy in some traits when using haplotype blocks compared to SNP-based predictions, however the magnitude of improvement was very trait- and model-specific. Especially in settings with low marker density, haplotype blocks can improve genomic prediction accuracy. In most cases, physically large haplotype blocks yielded a strong decrease in prediction accuracy. Especially when prediction accuracy varies greatly across different prediction models, prediction based on haplotype blocks can improve prediction accuracy of underperforming models. However, there is no “best” method to build haplotype blocks, since prediction accuracy varied considerably across methods and traits. Hence, criteria used to define haplotype blocks should not be viewed as fixed biological parameters, but rather as hyperparameters that need to be adjusted for every dataset.
format	Online Article Text
id	pubmed-10507710
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-105077102023-09-20 Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets Weber, Sven E. Frisch, Matthias Snowdon, Rod J. Voss-Fels, Kai P. Front Plant Sci Plant Science In modern plant breeding, genomic selection is becoming the gold standard for selection of superior genotypes. The basis for genomic prediction models is a set of phenotyped lines along with their genotypic profile. With high marker density and linkage disequilibrium (LD) between markers, genotype data in breeding populations tends to exhibit considerable redundancy. Therefore, interest is growing in the use of haplotype blocks to overcome redundancy by summarizing co-inherited features. Moreover, haplotype blocks can help to capture local epistasis caused by interacting loci. Here, we compared genomic prediction methods that either used single SNPs or haplotype blocks with regards to their prediction accuracy for important traits in crop datasets. We used four published datasets from canola, maize, wheat and soybean. Different approaches to construct haplotype blocks were compared, including blocks based on LD, physical distance, number of adjacent markers and the algorithms implemented in the software “Haploview” and “HaploBlocker”. The tested prediction methods included Genomic Best Linear Unbiased Prediction (GBLUP), Extended GBLUP to account for additive by additive epistasis (EGBLUP), Bayesian LASSO and Reproducing Kernel Hilbert Space (RKHS) regression. We found improved prediction accuracy in some traits when using haplotype blocks compared to SNP-based predictions, however the magnitude of improvement was very trait- and model-specific. Especially in settings with low marker density, haplotype blocks can improve genomic prediction accuracy. In most cases, physically large haplotype blocks yielded a strong decrease in prediction accuracy. Especially when prediction accuracy varies greatly across different prediction models, prediction based on haplotype blocks can improve prediction accuracy of underperforming models. However, there is no “best” method to build haplotype blocks, since prediction accuracy varied considerably across methods and traits. Hence, criteria used to define haplotype blocks should not be viewed as fixed biological parameters, but rather as hyperparameters that need to be adjusted for every dataset. Frontiers Media S.A. 2023-09-05 /pmc/articles/PMC10507710/ /pubmed/37731980 http://dx.doi.org/10.3389/fpls.2023.1217589 Text en Copyright © 2023 Weber, Frisch, Snowdon and Voss-Fels https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Plant Science Weber, Sven E. Frisch, Matthias Snowdon, Rod J. Voss-Fels, Kai P. Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets
title	Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets
title_full	Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets
title_fullStr	Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets
title_full_unstemmed	Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets
title_short	Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets
title_sort	haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets
topic	Plant Science
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10507710/ https://www.ncbi.nlm.nih.gov/pubmed/37731980 http://dx.doi.org/10.3389/fpls.2023.1217589
work_keys_str_mv	AT webersvene haplotypeblocksforgenomicpredictionacomparativeevaluationinmultiplecropdatasets AT frischmatthias haplotypeblocksforgenomicpredictionacomparativeevaluationinmultiplecropdatasets AT snowdonrodj haplotypeblocksforgenomicpredictionacomparativeevaluationinmultiplecropdatasets AT vossfelskaip haplotypeblocksforgenomicpredictionacomparativeevaluationinmultiplecropdatasets

Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets

Ejemplares similares