Cargando…

Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies

BACKGROUND: Use of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation...

Descripción completa

Detalles Bibliográficos
Autores principales: van den Berg, Sanne, Vandenplas, Jérémie, van Eeuwijk, Fred A., Bouwman, Aniek C., Lopes, Marcos S., Veerkamp, Roel F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6346588/
https://www.ncbi.nlm.nih.gov/pubmed/30678638
http://dx.doi.org/10.1186/s12711-019-0445-y
_version_ 1783389784645828608
author van den Berg, Sanne
Vandenplas, Jérémie
van Eeuwijk, Fred A.
Bouwman, Aniek C.
Lopes, Marcos S.
Veerkamp, Roel F.
author_facet van den Berg, Sanne
Vandenplas, Jérémie
van Eeuwijk, Fred A.
Bouwman, Aniek C.
Lopes, Marcos S.
Veerkamp, Roel F.
author_sort van den Berg, Sanne
collection PubMed
description BACKGROUND: Use of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation to WGS in two pig lines using a multi-line reference population and, subsequently, to investigate the effect of using these imputed WGS (iWGS) for GWAS. METHODS: Phenotypes and genotypes were available on 12,184 Large White pigs (LW-line) and 4943 Dutch Landrace pigs (DL-line). Imputed 660 K and 80 K genotypes for the LW-line and DL-line, respectively, were imputed to iWGS using Beagle v.4.1. Since only 32 LW-line and 12 DL-line boars were sequenced, 142 animals from eight commercial lines were added. GWAS were performed for each line using the 80 K and 660 K SNPs, the genotype scores of iWGS SNPs that had an imputation accuracy (Beagle R(2)) higher than 0.6, and the dosage scores of all iWGS SNPs. RESULTS: For the DL-line (LW-line), imputation of 80 K genotypes to iWGS resulted in an average Beagle R(2) of 0.39 (0.49). After quality control, 2.5 × 10(6) (3.5 × 10(6)) SNPs had a Beagle R(2) higher than 0.6, resulting in an average Beagle R(2) of 0.83 (0.93). Compared to the 80 K and 660 K genotypes, using iWGS led to the identification of 48.9 and 64.4% more QTL regions, for the DL-line and LW-line, respectively, and the most significant SNPs in the QTL regions explained a higher proportion of phenotypic variance. Using dosage instead of genotype scores improved the identification of QTL, because the model accounted for uncertainty of imputation, and all SNPs were used in the analysis. CONCLUSIONS: Imputation to WGS using the multi-line reference population resulted in relatively poor imputation, especially when imputing from 80 K (DL-line). In spite of the poor imputation accuracies, using iWGS instead of a lower density SNP chip increased the number of detected QTL and the estimated proportion of phenotypic variance explained by these QTL, especially when dosage scores were used instead of genotype scores. Thus, iWGS, even with poor imputation accuracy, can be used to identify possible interesting regions for fine mapping. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12711-019-0445-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6346588
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-63465882019-01-29 Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies van den Berg, Sanne Vandenplas, Jérémie van Eeuwijk, Fred A. Bouwman, Aniek C. Lopes, Marcos S. Veerkamp, Roel F. Genet Sel Evol Research Article BACKGROUND: Use of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation to WGS in two pig lines using a multi-line reference population and, subsequently, to investigate the effect of using these imputed WGS (iWGS) for GWAS. METHODS: Phenotypes and genotypes were available on 12,184 Large White pigs (LW-line) and 4943 Dutch Landrace pigs (DL-line). Imputed 660 K and 80 K genotypes for the LW-line and DL-line, respectively, were imputed to iWGS using Beagle v.4.1. Since only 32 LW-line and 12 DL-line boars were sequenced, 142 animals from eight commercial lines were added. GWAS were performed for each line using the 80 K and 660 K SNPs, the genotype scores of iWGS SNPs that had an imputation accuracy (Beagle R(2)) higher than 0.6, and the dosage scores of all iWGS SNPs. RESULTS: For the DL-line (LW-line), imputation of 80 K genotypes to iWGS resulted in an average Beagle R(2) of 0.39 (0.49). After quality control, 2.5 × 10(6) (3.5 × 10(6)) SNPs had a Beagle R(2) higher than 0.6, resulting in an average Beagle R(2) of 0.83 (0.93). Compared to the 80 K and 660 K genotypes, using iWGS led to the identification of 48.9 and 64.4% more QTL regions, for the DL-line and LW-line, respectively, and the most significant SNPs in the QTL regions explained a higher proportion of phenotypic variance. Using dosage instead of genotype scores improved the identification of QTL, because the model accounted for uncertainty of imputation, and all SNPs were used in the analysis. CONCLUSIONS: Imputation to WGS using the multi-line reference population resulted in relatively poor imputation, especially when imputing from 80 K (DL-line). In spite of the poor imputation accuracies, using iWGS instead of a lower density SNP chip increased the number of detected QTL and the estimated proportion of phenotypic variance explained by these QTL, especially when dosage scores were used instead of genotype scores. Thus, iWGS, even with poor imputation accuracy, can be used to identify possible interesting regions for fine mapping. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12711-019-0445-y) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-24 /pmc/articles/PMC6346588/ /pubmed/30678638 http://dx.doi.org/10.1186/s12711-019-0445-y Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
van den Berg, Sanne
Vandenplas, Jérémie
van Eeuwijk, Fred A.
Bouwman, Aniek C.
Lopes, Marcos S.
Veerkamp, Roel F.
Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies
title Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies
title_full Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies
title_fullStr Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies
title_full_unstemmed Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies
title_short Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies
title_sort imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6346588/
https://www.ncbi.nlm.nih.gov/pubmed/30678638
http://dx.doi.org/10.1186/s12711-019-0445-y
work_keys_str_mv AT vandenbergsanne imputationtowholegenomesequenceusingmultiplepigpopulationsanditsuseingenomewideassociationstudies
AT vandenplasjeremie imputationtowholegenomesequenceusingmultiplepigpopulationsanditsuseingenomewideassociationstudies
AT vaneeuwijkfreda imputationtowholegenomesequenceusingmultiplepigpopulationsanditsuseingenomewideassociationstudies
AT bouwmananiekc imputationtowholegenomesequenceusingmultiplepigpopulationsanditsuseingenomewideassociationstudies
AT lopesmarcoss imputationtowholegenomesequenceusingmultiplepigpopulationsanditsuseingenomewideassociationstudies
AT veerkamproelf imputationtowholegenomesequenceusingmultiplepigpopulationsanditsuseingenomewideassociationstudies