Cargando…

Utility of whole-genome sequence data for across-breed genomic prediction

BACKGROUND: Genomic prediction (GP) across breeds has so far resulted in low accuracies of the predicted genomic breeding values. Our objective was to evaluate whether using whole-genome sequence (WGS) instead of low-density markers can improve GP across breeds, especially when markers are pre-selec...

Descripción completa

Detalles Bibliográficos
Autores principales: Raymond, Biaty, Bouwman, Aniek C., Schrooten, Chris, Houwing-Duistermaat, Jeanine, Veerkamp, Roel F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5960108/
https://www.ncbi.nlm.nih.gov/pubmed/29776327
http://dx.doi.org/10.1186/s12711-018-0396-8
_version_ 1783324525908197376
author Raymond, Biaty
Bouwman, Aniek C.
Schrooten, Chris
Houwing-Duistermaat, Jeanine
Veerkamp, Roel F.
author_facet Raymond, Biaty
Bouwman, Aniek C.
Schrooten, Chris
Houwing-Duistermaat, Jeanine
Veerkamp, Roel F.
author_sort Raymond, Biaty
collection PubMed
description BACKGROUND: Genomic prediction (GP) across breeds has so far resulted in low accuracies of the predicted genomic breeding values. Our objective was to evaluate whether using whole-genome sequence (WGS) instead of low-density markers can improve GP across breeds, especially when markers are pre-selected from a genome-wide association study (GWAS), and to test our hypothesis that many non-causal markers in WGS data have a diluting effect on accuracy of across-breed prediction. METHODS: Estimated breeding values for stature and bovine high-density (HD) genotypes were available for 595 Jersey bulls from New Zealand, 957 Holstein bulls from New Zealand and 5553 Holstein bulls from the Netherlands. BovineHD genotypes for all bulls were imputed to WGS using Beagle4 and Minimac2. Genomic prediction across the three populations was performed with ASReml4, with each population used as single reference and as single validation sets. In addition to the 50k, HD and WGS, markers that were significantly associated with stature in a large meta-GWAS analysis were selected and used for prediction, resulting in 10 prediction scenarios. Furthermore, we estimated the proportion of genetic variance captured by markers in each scenario. RESULTS: Across breeds, 50k, HD and WGS markers resulted in very low accuracies of prediction ranging from − 0.04 to 0.13. Accuracies were higher in scenarios with pre-selected markers from a meta-GWAS. For example, using only the 133 most significant markers in 133 QTL regions from the meta-GWAS yielded accuracies ranging from 0.08 to 0.23, while 23,125 markers with a − log10(p) higher than 7 resulted in accuracies of up 0.35. Using WGS data did not significantly improve the proportion of genetic variance captured across breeds compared to scenarios with few but pre-selected markers. CONCLUSIONS: Our results demonstrated that the accuracy of across-breed GP can be improved by using markers that are pre-selected from WGS based on their potential causal effect. We also showed that simply increasing the number of markers up to the WGS level does not increase the accuracy of across-breed prediction, even when markers that are expected to have a causal effect are included.
format Online
Article
Text
id pubmed-5960108
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-59601082018-05-24 Utility of whole-genome sequence data for across-breed genomic prediction Raymond, Biaty Bouwman, Aniek C. Schrooten, Chris Houwing-Duistermaat, Jeanine Veerkamp, Roel F. Genet Sel Evol Research Article BACKGROUND: Genomic prediction (GP) across breeds has so far resulted in low accuracies of the predicted genomic breeding values. Our objective was to evaluate whether using whole-genome sequence (WGS) instead of low-density markers can improve GP across breeds, especially when markers are pre-selected from a genome-wide association study (GWAS), and to test our hypothesis that many non-causal markers in WGS data have a diluting effect on accuracy of across-breed prediction. METHODS: Estimated breeding values for stature and bovine high-density (HD) genotypes were available for 595 Jersey bulls from New Zealand, 957 Holstein bulls from New Zealand and 5553 Holstein bulls from the Netherlands. BovineHD genotypes for all bulls were imputed to WGS using Beagle4 and Minimac2. Genomic prediction across the three populations was performed with ASReml4, with each population used as single reference and as single validation sets. In addition to the 50k, HD and WGS, markers that were significantly associated with stature in a large meta-GWAS analysis were selected and used for prediction, resulting in 10 prediction scenarios. Furthermore, we estimated the proportion of genetic variance captured by markers in each scenario. RESULTS: Across breeds, 50k, HD and WGS markers resulted in very low accuracies of prediction ranging from − 0.04 to 0.13. Accuracies were higher in scenarios with pre-selected markers from a meta-GWAS. For example, using only the 133 most significant markers in 133 QTL regions from the meta-GWAS yielded accuracies ranging from 0.08 to 0.23, while 23,125 markers with a − log10(p) higher than 7 resulted in accuracies of up 0.35. Using WGS data did not significantly improve the proportion of genetic variance captured across breeds compared to scenarios with few but pre-selected markers. CONCLUSIONS: Our results demonstrated that the accuracy of across-breed GP can be improved by using markers that are pre-selected from WGS based on their potential causal effect. We also showed that simply increasing the number of markers up to the WGS level does not increase the accuracy of across-breed prediction, even when markers that are expected to have a causal effect are included. BioMed Central 2018-05-18 /pmc/articles/PMC5960108/ /pubmed/29776327 http://dx.doi.org/10.1186/s12711-018-0396-8 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Raymond, Biaty
Bouwman, Aniek C.
Schrooten, Chris
Houwing-Duistermaat, Jeanine
Veerkamp, Roel F.
Utility of whole-genome sequence data for across-breed genomic prediction
title Utility of whole-genome sequence data for across-breed genomic prediction
title_full Utility of whole-genome sequence data for across-breed genomic prediction
title_fullStr Utility of whole-genome sequence data for across-breed genomic prediction
title_full_unstemmed Utility of whole-genome sequence data for across-breed genomic prediction
title_short Utility of whole-genome sequence data for across-breed genomic prediction
title_sort utility of whole-genome sequence data for across-breed genomic prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5960108/
https://www.ncbi.nlm.nih.gov/pubmed/29776327
http://dx.doi.org/10.1186/s12711-018-0396-8
work_keys_str_mv AT raymondbiaty utilityofwholegenomesequencedataforacrossbreedgenomicprediction
AT bouwmananiekc utilityofwholegenomesequencedataforacrossbreedgenomicprediction
AT schrootenchris utilityofwholegenomesequencedataforacrossbreedgenomicprediction
AT houwingduistermaatjeanine utilityofwholegenomesequencedataforacrossbreedgenomicprediction
AT veerkamproelf utilityofwholegenomesequencedataforacrossbreedgenomicprediction