Cargando…

Development and validation of a horse reference panel for genotype imputation

BACKGROUND: Genotype imputation is a cost-effective method to generate sequence-level genotypes for a large number of animals. Its application can improve the power of genomic studies, provided that the accuracy of imputation is sufficiently high. The purpose of this study was to develop an optimal...

Descripción completa

Detalles Bibliográficos
Autores principales: Reich, Paula, Falker-Gieske, Clemens, Pook, Torsten, Tetens, Jens
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9252005/
https://www.ncbi.nlm.nih.gov/pubmed/35787788
http://dx.doi.org/10.1186/s12711-022-00740-8
_version_ 1784740167189266432
author Reich, Paula
Falker-Gieske, Clemens
Pook, Torsten
Tetens, Jens
author_facet Reich, Paula
Falker-Gieske, Clemens
Pook, Torsten
Tetens, Jens
author_sort Reich, Paula
collection PubMed
description BACKGROUND: Genotype imputation is a cost-effective method to generate sequence-level genotypes for a large number of animals. Its application can improve the power of genomic studies, provided that the accuracy of imputation is sufficiently high. The purpose of this study was to develop an optimal strategy for genotype imputation from genotyping array data to sequence level in German warmblood horses, and to investigate the effect of different factors on the accuracy of imputation. Publicly available whole-genome sequence data from 317 horses of 46 breeds was used to conduct the analyses. RESULTS: Depending on the size and composition of the reference panel, the accuracy of imputation from medium marker density (60K) to sequence level using the software Beagle 5.1 ranged from 0.64 to 0.70 for horse chromosome 3. Generally, imputation accuracy increased as the size of the reference panel increased, but if genetically distant individuals were included in the panel, the accuracy dropped. Imputation was most precise when using a reference panel of multiple but related breeds and the software Beagle 5.1, which outperformed the other two tested computer programs, Impute 5 and Minimac 4. Genome-wide imputation for this scenario resulted in a mean accuracy of 0.66. Stepwise imputation from 60K to 670K markers and subsequently to sequence level did not improve the accuracy of imputation. However, imputation from higher density (670K) was considerably more accurate (about 0.90) than from medium density. Likewise, imputation in genomic regions with a low marker coverage resulted in a reduced accuracy of imputation. CONCLUSIONS: The accuracy of imputation in horses was influenced by the size and composition of the reference panel, the marker density of the genotyping array, and the imputation software. Genotype imputation can be used to extend the limited amount of available sequence-level data from horses in order to boost the power of downstream analyses, such as genome-wide association studies, or the detection of embryonic lethal variants. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12711-022-00740-8.
format Online
Article
Text
id pubmed-9252005
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-92520052022-07-05 Development and validation of a horse reference panel for genotype imputation Reich, Paula Falker-Gieske, Clemens Pook, Torsten Tetens, Jens Genet Sel Evol Research Article BACKGROUND: Genotype imputation is a cost-effective method to generate sequence-level genotypes for a large number of animals. Its application can improve the power of genomic studies, provided that the accuracy of imputation is sufficiently high. The purpose of this study was to develop an optimal strategy for genotype imputation from genotyping array data to sequence level in German warmblood horses, and to investigate the effect of different factors on the accuracy of imputation. Publicly available whole-genome sequence data from 317 horses of 46 breeds was used to conduct the analyses. RESULTS: Depending on the size and composition of the reference panel, the accuracy of imputation from medium marker density (60K) to sequence level using the software Beagle 5.1 ranged from 0.64 to 0.70 for horse chromosome 3. Generally, imputation accuracy increased as the size of the reference panel increased, but if genetically distant individuals were included in the panel, the accuracy dropped. Imputation was most precise when using a reference panel of multiple but related breeds and the software Beagle 5.1, which outperformed the other two tested computer programs, Impute 5 and Minimac 4. Genome-wide imputation for this scenario resulted in a mean accuracy of 0.66. Stepwise imputation from 60K to 670K markers and subsequently to sequence level did not improve the accuracy of imputation. However, imputation from higher density (670K) was considerably more accurate (about 0.90) than from medium density. Likewise, imputation in genomic regions with a low marker coverage resulted in a reduced accuracy of imputation. CONCLUSIONS: The accuracy of imputation in horses was influenced by the size and composition of the reference panel, the marker density of the genotyping array, and the imputation software. Genotype imputation can be used to extend the limited amount of available sequence-level data from horses in order to boost the power of downstream analyses, such as genome-wide association studies, or the detection of embryonic lethal variants. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12711-022-00740-8. BioMed Central 2022-07-04 /pmc/articles/PMC9252005/ /pubmed/35787788 http://dx.doi.org/10.1186/s12711-022-00740-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Reich, Paula
Falker-Gieske, Clemens
Pook, Torsten
Tetens, Jens
Development and validation of a horse reference panel for genotype imputation
title Development and validation of a horse reference panel for genotype imputation
title_full Development and validation of a horse reference panel for genotype imputation
title_fullStr Development and validation of a horse reference panel for genotype imputation
title_full_unstemmed Development and validation of a horse reference panel for genotype imputation
title_short Development and validation of a horse reference panel for genotype imputation
title_sort development and validation of a horse reference panel for genotype imputation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9252005/
https://www.ncbi.nlm.nih.gov/pubmed/35787788
http://dx.doi.org/10.1186/s12711-022-00740-8
work_keys_str_mv AT reichpaula developmentandvalidationofahorsereferencepanelforgenotypeimputation
AT falkergieskeclemens developmentandvalidationofahorsereferencepanelforgenotypeimputation
AT pooktorsten developmentandvalidationofahorsereferencepanelforgenotypeimputation
AT tetensjens developmentandvalidationofahorsereferencepanelforgenotypeimputation