Cargando…

Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs

BACKGROUND: For genomic selection in populations with a small reference population, combining populations of the same breed or populations of related breeds is an effective way to increase the size of the reference population. However, genomic predictions based on single nucleotide polymorphism (SNP...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Hailiang, Ye, Shaopan, Jiang, Yifan, Zhang, Zhe, Zhang, Qin, Ding, Xiangdong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805481/
https://www.ncbi.nlm.nih.gov/pubmed/31638889
http://dx.doi.org/10.1186/s12711-019-0500-8
_version_ 1783461395062325248
author Song, Hailiang
Ye, Shaopan
Jiang, Yifan
Zhang, Zhe
Zhang, Qin
Ding, Xiangdong
author_facet Song, Hailiang
Ye, Shaopan
Jiang, Yifan
Zhang, Zhe
Zhang, Qin
Ding, Xiangdong
author_sort Song, Hailiang
collection PubMed
description BACKGROUND: For genomic selection in populations with a small reference population, combining populations of the same breed or populations of related breeds is an effective way to increase the size of the reference population. However, genomic predictions based on single nucleotide polymorphism (SNP)-chip genotype data using combined populations with different genetic backgrounds or from different breeds have not shown a clear advantage over using within-population or within-breed predictions. The increasing availability of whole-genome sequencing (WGS) data provides new opportunities for combined population genomic prediction. Our objective was to investigate the accuracy of genomic prediction using imputation-based WGS data from combined populations in pigs. Using 80K SNP panel genotypes, WGS genotypes, or genotypes on WGS variants that were pruned based on linkage disequilibrium (LD), three methods [genomic best linear unbiased prediction (GBLUP), single-step (ss)GBLUP, and genomic feature (GF)BLUP] were implemented with different prior information to identify the best method to improve the accuracy of genomic prediction for combined populations in pigs. RESULTS: In total, 2089 and 2043 individuals with production and reproduction phenotypes, respectively, from three Yorkshire populations with different genetic backgrounds were genotyped with the PorcineSNP80 panel. Imputation accuracy from 80K to WGS variants reached 92%. The results showed that use of the WGS data compared to the 80K SNP panel did not increase the accuracy of genomic prediction in a single population, but using WGS data with LD pruning and GFBLUP with prior information did yield higher accuracy than the 80K SNP panel. For the 80K SNP panel genotypes, using the combined population resulted in a slight improvement, no change, or even a slight decrease in accuracy in comparison with the single population for GBLUP and ssGBLUP, while accuracy increased by 1 to 2.4% when using WGS data. Notably, the GFBLUP method did not perform well for both the combined population and the single populations. CONCLUSIONS: The use of WGS data was beneficial for combined population genomic prediction. Simply increasing the number of SNPs to the WGS level did not increase accuracy for a single population, while using pruned WGS data based on LD and GFBLUP with prior information could yield higher accuracy than the 80K SNP panel.
format Online
Article
Text
id pubmed-6805481
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68054812019-10-24 Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs Song, Hailiang Ye, Shaopan Jiang, Yifan Zhang, Zhe Zhang, Qin Ding, Xiangdong Genet Sel Evol Research Article BACKGROUND: For genomic selection in populations with a small reference population, combining populations of the same breed or populations of related breeds is an effective way to increase the size of the reference population. However, genomic predictions based on single nucleotide polymorphism (SNP)-chip genotype data using combined populations with different genetic backgrounds or from different breeds have not shown a clear advantage over using within-population or within-breed predictions. The increasing availability of whole-genome sequencing (WGS) data provides new opportunities for combined population genomic prediction. Our objective was to investigate the accuracy of genomic prediction using imputation-based WGS data from combined populations in pigs. Using 80K SNP panel genotypes, WGS genotypes, or genotypes on WGS variants that were pruned based on linkage disequilibrium (LD), three methods [genomic best linear unbiased prediction (GBLUP), single-step (ss)GBLUP, and genomic feature (GF)BLUP] were implemented with different prior information to identify the best method to improve the accuracy of genomic prediction for combined populations in pigs. RESULTS: In total, 2089 and 2043 individuals with production and reproduction phenotypes, respectively, from three Yorkshire populations with different genetic backgrounds were genotyped with the PorcineSNP80 panel. Imputation accuracy from 80K to WGS variants reached 92%. The results showed that use of the WGS data compared to the 80K SNP panel did not increase the accuracy of genomic prediction in a single population, but using WGS data with LD pruning and GFBLUP with prior information did yield higher accuracy than the 80K SNP panel. For the 80K SNP panel genotypes, using the combined population resulted in a slight improvement, no change, or even a slight decrease in accuracy in comparison with the single population for GBLUP and ssGBLUP, while accuracy increased by 1 to 2.4% when using WGS data. Notably, the GFBLUP method did not perform well for both the combined population and the single populations. CONCLUSIONS: The use of WGS data was beneficial for combined population genomic prediction. Simply increasing the number of SNPs to the WGS level did not increase accuracy for a single population, while using pruned WGS data based on LD and GFBLUP with prior information could yield higher accuracy than the 80K SNP panel. BioMed Central 2019-10-21 /pmc/articles/PMC6805481/ /pubmed/31638889 http://dx.doi.org/10.1186/s12711-019-0500-8 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Song, Hailiang
Ye, Shaopan
Jiang, Yifan
Zhang, Zhe
Zhang, Qin
Ding, Xiangdong
Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs
title Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs
title_full Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs
title_fullStr Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs
title_full_unstemmed Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs
title_short Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs
title_sort using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6805481/
https://www.ncbi.nlm.nih.gov/pubmed/31638889
http://dx.doi.org/10.1186/s12711-019-0500-8
work_keys_str_mv AT songhailiang usingimputationbasedwholegenomesequencingdatatoimprovetheaccuracyofgenomicpredictionforcombinedpopulationsinpigs
AT yeshaopan usingimputationbasedwholegenomesequencingdatatoimprovetheaccuracyofgenomicpredictionforcombinedpopulationsinpigs
AT jiangyifan usingimputationbasedwholegenomesequencingdatatoimprovetheaccuracyofgenomicpredictionforcombinedpopulationsinpigs
AT zhangzhe usingimputationbasedwholegenomesequencingdatatoimprovetheaccuracyofgenomicpredictionforcombinedpopulationsinpigs
AT zhangqin usingimputationbasedwholegenomesequencingdatatoimprovetheaccuracyofgenomicpredictionforcombinedpopulationsinpigs
AT dingxiangdong usingimputationbasedwholegenomesequencingdatatoimprovetheaccuracyofgenomicpredictionforcombinedpopulationsinpigs