Cargando…
Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies
BACKGROUND: Use of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6346588/ https://www.ncbi.nlm.nih.gov/pubmed/30678638 http://dx.doi.org/10.1186/s12711-019-0445-y |
_version_ | 1783389784645828608 |
---|---|
author | van den Berg, Sanne Vandenplas, Jérémie van Eeuwijk, Fred A. Bouwman, Aniek C. Lopes, Marcos S. Veerkamp, Roel F. |
author_facet | van den Berg, Sanne Vandenplas, Jérémie van Eeuwijk, Fred A. Bouwman, Aniek C. Lopes, Marcos S. Veerkamp, Roel F. |
author_sort | van den Berg, Sanne |
collection | PubMed |
description | BACKGROUND: Use of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation to WGS in two pig lines using a multi-line reference population and, subsequently, to investigate the effect of using these imputed WGS (iWGS) for GWAS. METHODS: Phenotypes and genotypes were available on 12,184 Large White pigs (LW-line) and 4943 Dutch Landrace pigs (DL-line). Imputed 660 K and 80 K genotypes for the LW-line and DL-line, respectively, were imputed to iWGS using Beagle v.4.1. Since only 32 LW-line and 12 DL-line boars were sequenced, 142 animals from eight commercial lines were added. GWAS were performed for each line using the 80 K and 660 K SNPs, the genotype scores of iWGS SNPs that had an imputation accuracy (Beagle R(2)) higher than 0.6, and the dosage scores of all iWGS SNPs. RESULTS: For the DL-line (LW-line), imputation of 80 K genotypes to iWGS resulted in an average Beagle R(2) of 0.39 (0.49). After quality control, 2.5 × 10(6) (3.5 × 10(6)) SNPs had a Beagle R(2) higher than 0.6, resulting in an average Beagle R(2) of 0.83 (0.93). Compared to the 80 K and 660 K genotypes, using iWGS led to the identification of 48.9 and 64.4% more QTL regions, for the DL-line and LW-line, respectively, and the most significant SNPs in the QTL regions explained a higher proportion of phenotypic variance. Using dosage instead of genotype scores improved the identification of QTL, because the model accounted for uncertainty of imputation, and all SNPs were used in the analysis. CONCLUSIONS: Imputation to WGS using the multi-line reference population resulted in relatively poor imputation, especially when imputing from 80 K (DL-line). In spite of the poor imputation accuracies, using iWGS instead of a lower density SNP chip increased the number of detected QTL and the estimated proportion of phenotypic variance explained by these QTL, especially when dosage scores were used instead of genotype scores. Thus, iWGS, even with poor imputation accuracy, can be used to identify possible interesting regions for fine mapping. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12711-019-0445-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6346588 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63465882019-01-29 Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies van den Berg, Sanne Vandenplas, Jérémie van Eeuwijk, Fred A. Bouwman, Aniek C. Lopes, Marcos S. Veerkamp, Roel F. Genet Sel Evol Research Article BACKGROUND: Use of whole-genome sequence data (WGS) is expected to improve identification of quantitative trait loci (QTL). However, this requires imputation to WGS, often with a limited number of sequenced animals for the target population. The objective of this study was to investigate imputation to WGS in two pig lines using a multi-line reference population and, subsequently, to investigate the effect of using these imputed WGS (iWGS) for GWAS. METHODS: Phenotypes and genotypes were available on 12,184 Large White pigs (LW-line) and 4943 Dutch Landrace pigs (DL-line). Imputed 660 K and 80 K genotypes for the LW-line and DL-line, respectively, were imputed to iWGS using Beagle v.4.1. Since only 32 LW-line and 12 DL-line boars were sequenced, 142 animals from eight commercial lines were added. GWAS were performed for each line using the 80 K and 660 K SNPs, the genotype scores of iWGS SNPs that had an imputation accuracy (Beagle R(2)) higher than 0.6, and the dosage scores of all iWGS SNPs. RESULTS: For the DL-line (LW-line), imputation of 80 K genotypes to iWGS resulted in an average Beagle R(2) of 0.39 (0.49). After quality control, 2.5 × 10(6) (3.5 × 10(6)) SNPs had a Beagle R(2) higher than 0.6, resulting in an average Beagle R(2) of 0.83 (0.93). Compared to the 80 K and 660 K genotypes, using iWGS led to the identification of 48.9 and 64.4% more QTL regions, for the DL-line and LW-line, respectively, and the most significant SNPs in the QTL regions explained a higher proportion of phenotypic variance. Using dosage instead of genotype scores improved the identification of QTL, because the model accounted for uncertainty of imputation, and all SNPs were used in the analysis. CONCLUSIONS: Imputation to WGS using the multi-line reference population resulted in relatively poor imputation, especially when imputing from 80 K (DL-line). In spite of the poor imputation accuracies, using iWGS instead of a lower density SNP chip increased the number of detected QTL and the estimated proportion of phenotypic variance explained by these QTL, especially when dosage scores were used instead of genotype scores. Thus, iWGS, even with poor imputation accuracy, can be used to identify possible interesting regions for fine mapping. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12711-019-0445-y) contains supplementary material, which is available to authorized users. BioMed Central 2019-01-24 /pmc/articles/PMC6346588/ /pubmed/30678638 http://dx.doi.org/10.1186/s12711-019-0445-y Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article van den Berg, Sanne Vandenplas, Jérémie van Eeuwijk, Fred A. Bouwman, Aniek C. Lopes, Marcos S. Veerkamp, Roel F. Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies |
title | Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies |
title_full | Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies |
title_fullStr | Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies |
title_full_unstemmed | Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies |
title_short | Imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies |
title_sort | imputation to whole-genome sequence using multiple pig populations and its use in genome-wide association studies |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6346588/ https://www.ncbi.nlm.nih.gov/pubmed/30678638 http://dx.doi.org/10.1186/s12711-019-0445-y |
work_keys_str_mv | AT vandenbergsanne imputationtowholegenomesequenceusingmultiplepigpopulationsanditsuseingenomewideassociationstudies AT vandenplasjeremie imputationtowholegenomesequenceusingmultiplepigpopulationsanditsuseingenomewideassociationstudies AT vaneeuwijkfreda imputationtowholegenomesequenceusingmultiplepigpopulationsanditsuseingenomewideassociationstudies AT bouwmananiekc imputationtowholegenomesequenceusingmultiplepigpopulationsanditsuseingenomewideassociationstudies AT lopesmarcoss imputationtowholegenomesequenceusingmultiplepigpopulationsanditsuseingenomewideassociationstudies AT veerkamproelf imputationtowholegenomesequenceusingmultiplepigpopulationsanditsuseingenomewideassociationstudies |