Cargando…
Development and validation of a horse reference panel for genotype imputation
BACKGROUND: Genotype imputation is a cost-effective method to generate sequence-level genotypes for a large number of animals. Its application can improve the power of genomic studies, provided that the accuracy of imputation is sufficiently high. The purpose of this study was to develop an optimal...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9252005/ https://www.ncbi.nlm.nih.gov/pubmed/35787788 http://dx.doi.org/10.1186/s12711-022-00740-8 |
_version_ | 1784740167189266432 |
---|---|
author | Reich, Paula Falker-Gieske, Clemens Pook, Torsten Tetens, Jens |
author_facet | Reich, Paula Falker-Gieske, Clemens Pook, Torsten Tetens, Jens |
author_sort | Reich, Paula |
collection | PubMed |
description | BACKGROUND: Genotype imputation is a cost-effective method to generate sequence-level genotypes for a large number of animals. Its application can improve the power of genomic studies, provided that the accuracy of imputation is sufficiently high. The purpose of this study was to develop an optimal strategy for genotype imputation from genotyping array data to sequence level in German warmblood horses, and to investigate the effect of different factors on the accuracy of imputation. Publicly available whole-genome sequence data from 317 horses of 46 breeds was used to conduct the analyses. RESULTS: Depending on the size and composition of the reference panel, the accuracy of imputation from medium marker density (60K) to sequence level using the software Beagle 5.1 ranged from 0.64 to 0.70 for horse chromosome 3. Generally, imputation accuracy increased as the size of the reference panel increased, but if genetically distant individuals were included in the panel, the accuracy dropped. Imputation was most precise when using a reference panel of multiple but related breeds and the software Beagle 5.1, which outperformed the other two tested computer programs, Impute 5 and Minimac 4. Genome-wide imputation for this scenario resulted in a mean accuracy of 0.66. Stepwise imputation from 60K to 670K markers and subsequently to sequence level did not improve the accuracy of imputation. However, imputation from higher density (670K) was considerably more accurate (about 0.90) than from medium density. Likewise, imputation in genomic regions with a low marker coverage resulted in a reduced accuracy of imputation. CONCLUSIONS: The accuracy of imputation in horses was influenced by the size and composition of the reference panel, the marker density of the genotyping array, and the imputation software. Genotype imputation can be used to extend the limited amount of available sequence-level data from horses in order to boost the power of downstream analyses, such as genome-wide association studies, or the detection of embryonic lethal variants. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12711-022-00740-8. |
format | Online Article Text |
id | pubmed-9252005 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-92520052022-07-05 Development and validation of a horse reference panel for genotype imputation Reich, Paula Falker-Gieske, Clemens Pook, Torsten Tetens, Jens Genet Sel Evol Research Article BACKGROUND: Genotype imputation is a cost-effective method to generate sequence-level genotypes for a large number of animals. Its application can improve the power of genomic studies, provided that the accuracy of imputation is sufficiently high. The purpose of this study was to develop an optimal strategy for genotype imputation from genotyping array data to sequence level in German warmblood horses, and to investigate the effect of different factors on the accuracy of imputation. Publicly available whole-genome sequence data from 317 horses of 46 breeds was used to conduct the analyses. RESULTS: Depending on the size and composition of the reference panel, the accuracy of imputation from medium marker density (60K) to sequence level using the software Beagle 5.1 ranged from 0.64 to 0.70 for horse chromosome 3. Generally, imputation accuracy increased as the size of the reference panel increased, but if genetically distant individuals were included in the panel, the accuracy dropped. Imputation was most precise when using a reference panel of multiple but related breeds and the software Beagle 5.1, which outperformed the other two tested computer programs, Impute 5 and Minimac 4. Genome-wide imputation for this scenario resulted in a mean accuracy of 0.66. Stepwise imputation from 60K to 670K markers and subsequently to sequence level did not improve the accuracy of imputation. However, imputation from higher density (670K) was considerably more accurate (about 0.90) than from medium density. Likewise, imputation in genomic regions with a low marker coverage resulted in a reduced accuracy of imputation. CONCLUSIONS: The accuracy of imputation in horses was influenced by the size and composition of the reference panel, the marker density of the genotyping array, and the imputation software. Genotype imputation can be used to extend the limited amount of available sequence-level data from horses in order to boost the power of downstream analyses, such as genome-wide association studies, or the detection of embryonic lethal variants. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12711-022-00740-8. BioMed Central 2022-07-04 /pmc/articles/PMC9252005/ /pubmed/35787788 http://dx.doi.org/10.1186/s12711-022-00740-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Reich, Paula Falker-Gieske, Clemens Pook, Torsten Tetens, Jens Development and validation of a horse reference panel for genotype imputation |
title | Development and validation of a horse reference panel for genotype imputation |
title_full | Development and validation of a horse reference panel for genotype imputation |
title_fullStr | Development and validation of a horse reference panel for genotype imputation |
title_full_unstemmed | Development and validation of a horse reference panel for genotype imputation |
title_short | Development and validation of a horse reference panel for genotype imputation |
title_sort | development and validation of a horse reference panel for genotype imputation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9252005/ https://www.ncbi.nlm.nih.gov/pubmed/35787788 http://dx.doi.org/10.1186/s12711-022-00740-8 |
work_keys_str_mv | AT reichpaula developmentandvalidationofahorsereferencepanelforgenotypeimputation AT falkergieskeclemens developmentandvalidationofahorsereferencepanelforgenotypeimputation AT pooktorsten developmentandvalidationofahorsereferencepanelforgenotypeimputation AT tetensjens developmentandvalidationofahorsereferencepanelforgenotypeimputation |