Cargando…
Utility of whole-genome sequence data for across-breed genomic prediction
BACKGROUND: Genomic prediction (GP) across breeds has so far resulted in low accuracies of the predicted genomic breeding values. Our objective was to evaluate whether using whole-genome sequence (WGS) instead of low-density markers can improve GP across breeds, especially when markers are pre-selec...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5960108/ https://www.ncbi.nlm.nih.gov/pubmed/29776327 http://dx.doi.org/10.1186/s12711-018-0396-8 |
_version_ | 1783324525908197376 |
---|---|
author | Raymond, Biaty Bouwman, Aniek C. Schrooten, Chris Houwing-Duistermaat, Jeanine Veerkamp, Roel F. |
author_facet | Raymond, Biaty Bouwman, Aniek C. Schrooten, Chris Houwing-Duistermaat, Jeanine Veerkamp, Roel F. |
author_sort | Raymond, Biaty |
collection | PubMed |
description | BACKGROUND: Genomic prediction (GP) across breeds has so far resulted in low accuracies of the predicted genomic breeding values. Our objective was to evaluate whether using whole-genome sequence (WGS) instead of low-density markers can improve GP across breeds, especially when markers are pre-selected from a genome-wide association study (GWAS), and to test our hypothesis that many non-causal markers in WGS data have a diluting effect on accuracy of across-breed prediction. METHODS: Estimated breeding values for stature and bovine high-density (HD) genotypes were available for 595 Jersey bulls from New Zealand, 957 Holstein bulls from New Zealand and 5553 Holstein bulls from the Netherlands. BovineHD genotypes for all bulls were imputed to WGS using Beagle4 and Minimac2. Genomic prediction across the three populations was performed with ASReml4, with each population used as single reference and as single validation sets. In addition to the 50k, HD and WGS, markers that were significantly associated with stature in a large meta-GWAS analysis were selected and used for prediction, resulting in 10 prediction scenarios. Furthermore, we estimated the proportion of genetic variance captured by markers in each scenario. RESULTS: Across breeds, 50k, HD and WGS markers resulted in very low accuracies of prediction ranging from − 0.04 to 0.13. Accuracies were higher in scenarios with pre-selected markers from a meta-GWAS. For example, using only the 133 most significant markers in 133 QTL regions from the meta-GWAS yielded accuracies ranging from 0.08 to 0.23, while 23,125 markers with a − log10(p) higher than 7 resulted in accuracies of up 0.35. Using WGS data did not significantly improve the proportion of genetic variance captured across breeds compared to scenarios with few but pre-selected markers. CONCLUSIONS: Our results demonstrated that the accuracy of across-breed GP can be improved by using markers that are pre-selected from WGS based on their potential causal effect. We also showed that simply increasing the number of markers up to the WGS level does not increase the accuracy of across-breed prediction, even when markers that are expected to have a causal effect are included. |
format | Online Article Text |
id | pubmed-5960108 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-59601082018-05-24 Utility of whole-genome sequence data for across-breed genomic prediction Raymond, Biaty Bouwman, Aniek C. Schrooten, Chris Houwing-Duistermaat, Jeanine Veerkamp, Roel F. Genet Sel Evol Research Article BACKGROUND: Genomic prediction (GP) across breeds has so far resulted in low accuracies of the predicted genomic breeding values. Our objective was to evaluate whether using whole-genome sequence (WGS) instead of low-density markers can improve GP across breeds, especially when markers are pre-selected from a genome-wide association study (GWAS), and to test our hypothesis that many non-causal markers in WGS data have a diluting effect on accuracy of across-breed prediction. METHODS: Estimated breeding values for stature and bovine high-density (HD) genotypes were available for 595 Jersey bulls from New Zealand, 957 Holstein bulls from New Zealand and 5553 Holstein bulls from the Netherlands. BovineHD genotypes for all bulls were imputed to WGS using Beagle4 and Minimac2. Genomic prediction across the three populations was performed with ASReml4, with each population used as single reference and as single validation sets. In addition to the 50k, HD and WGS, markers that were significantly associated with stature in a large meta-GWAS analysis were selected and used for prediction, resulting in 10 prediction scenarios. Furthermore, we estimated the proportion of genetic variance captured by markers in each scenario. RESULTS: Across breeds, 50k, HD and WGS markers resulted in very low accuracies of prediction ranging from − 0.04 to 0.13. Accuracies were higher in scenarios with pre-selected markers from a meta-GWAS. For example, using only the 133 most significant markers in 133 QTL regions from the meta-GWAS yielded accuracies ranging from 0.08 to 0.23, while 23,125 markers with a − log10(p) higher than 7 resulted in accuracies of up 0.35. Using WGS data did not significantly improve the proportion of genetic variance captured across breeds compared to scenarios with few but pre-selected markers. CONCLUSIONS: Our results demonstrated that the accuracy of across-breed GP can be improved by using markers that are pre-selected from WGS based on their potential causal effect. We also showed that simply increasing the number of markers up to the WGS level does not increase the accuracy of across-breed prediction, even when markers that are expected to have a causal effect are included. BioMed Central 2018-05-18 /pmc/articles/PMC5960108/ /pubmed/29776327 http://dx.doi.org/10.1186/s12711-018-0396-8 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Raymond, Biaty Bouwman, Aniek C. Schrooten, Chris Houwing-Duistermaat, Jeanine Veerkamp, Roel F. Utility of whole-genome sequence data for across-breed genomic prediction |
title | Utility of whole-genome sequence data for across-breed genomic prediction |
title_full | Utility of whole-genome sequence data for across-breed genomic prediction |
title_fullStr | Utility of whole-genome sequence data for across-breed genomic prediction |
title_full_unstemmed | Utility of whole-genome sequence data for across-breed genomic prediction |
title_short | Utility of whole-genome sequence data for across-breed genomic prediction |
title_sort | utility of whole-genome sequence data for across-breed genomic prediction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5960108/ https://www.ncbi.nlm.nih.gov/pubmed/29776327 http://dx.doi.org/10.1186/s12711-018-0396-8 |
work_keys_str_mv | AT raymondbiaty utilityofwholegenomesequencedataforacrossbreedgenomicprediction AT bouwmananiekc utilityofwholegenomesequencedataforacrossbreedgenomicprediction AT schrootenchris utilityofwholegenomesequencedataforacrossbreedgenomicprediction AT houwingduistermaatjeanine utilityofwholegenomesequencedataforacrossbreedgenomicprediction AT veerkamproelf utilityofwholegenomesequencedataforacrossbreedgenomicprediction |