Cargando…
Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle
Recently, the cattle genome sequence has been completed, followed by developing a commercial single nucleotide polymorphism (SNP) chip panel in the animal genome industry. In order to increase statistical power for detecting quantitative trait locus (QTL), a number of animals should be genotyped. Ho...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Korean Society of Animal Sciences and Technology
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8672260/ https://www.ncbi.nlm.nih.gov/pubmed/34957440 http://dx.doi.org/10.5187/jast.2021.e117 |
_version_ | 1784615323429765120 |
---|---|
author | Lee, DooHo Kim, Yeongkuk Chung, Yoonji Lee, Dongjae Seo, Dongwon Choi, Tae Jeong Lim, Dajeong Yoon, Duhak Lee, Seung Hwan |
author_facet | Lee, DooHo Kim, Yeongkuk Chung, Yoonji Lee, Dongjae Seo, Dongwon Choi, Tae Jeong Lim, Dajeong Yoon, Duhak Lee, Seung Hwan |
author_sort | Lee, DooHo |
collection | PubMed |
description | Recently, the cattle genome sequence has been completed, followed by developing a commercial single nucleotide polymorphism (SNP) chip panel in the animal genome industry. In order to increase statistical power for detecting quantitative trait locus (QTL), a number of animals should be genotyped. However, a high-density chip for many animals would be increasing the genotyping cost. Therefore, statistical inference of genotype imputation (low-density chip to high-density) will be useful in the animal industry. The purpose of this study is to investigate the effect of the reference population size and marker density on the imputation accuracy and to suggest the appropriate number of reference population sets for the imputation in Hanwoo cattle. A total of 3,821 Hanwoo cattle were divided into reference and validation populations. The reference sets consisted of 50k (38,916) marker data and different population sizes (500, 1,000, 1,500, 2,000, and 3,600). The validation sets consisted of four validation sets (Total 889) and the different marker density (5k [5,000], 10k [10,000], and 15k [15,000]). The accuracy of imputation was calculated by direct comparison of the true genotype and the imputed genotype. In conclusion, when the lowest marker density (5k) was used in the validation set, according to the reference population size, the imputation accuracy was 0.793 to 0.929. On the other hand, when the highest marker density (15k), according to the reference population size, the imputation accuracy was 0.904 to 0.967. Moreover, the reference population size should be more than 1,000 to obtain at least 88% imputation accuracy in Hanwoo cattle. |
format | Online Article Text |
id | pubmed-8672260 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Korean Society of Animal Sciences and Technology |
record_format | MEDLINE/PubMed |
spelling | pubmed-86722602021-12-23 Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle Lee, DooHo Kim, Yeongkuk Chung, Yoonji Lee, Dongjae Seo, Dongwon Choi, Tae Jeong Lim, Dajeong Yoon, Duhak Lee, Seung Hwan J Anim Sci Technol Research Article Recently, the cattle genome sequence has been completed, followed by developing a commercial single nucleotide polymorphism (SNP) chip panel in the animal genome industry. In order to increase statistical power for detecting quantitative trait locus (QTL), a number of animals should be genotyped. However, a high-density chip for many animals would be increasing the genotyping cost. Therefore, statistical inference of genotype imputation (low-density chip to high-density) will be useful in the animal industry. The purpose of this study is to investigate the effect of the reference population size and marker density on the imputation accuracy and to suggest the appropriate number of reference population sets for the imputation in Hanwoo cattle. A total of 3,821 Hanwoo cattle were divided into reference and validation populations. The reference sets consisted of 50k (38,916) marker data and different population sizes (500, 1,000, 1,500, 2,000, and 3,600). The validation sets consisted of four validation sets (Total 889) and the different marker density (5k [5,000], 10k [10,000], and 15k [15,000]). The accuracy of imputation was calculated by direct comparison of the true genotype and the imputed genotype. In conclusion, when the lowest marker density (5k) was used in the validation set, according to the reference population size, the imputation accuracy was 0.793 to 0.929. On the other hand, when the highest marker density (15k), according to the reference population size, the imputation accuracy was 0.904 to 0.967. Moreover, the reference population size should be more than 1,000 to obtain at least 88% imputation accuracy in Hanwoo cattle. Korean Society of Animal Sciences and Technology 2021-11 2021-11-30 /pmc/articles/PMC8672260/ /pubmed/34957440 http://dx.doi.org/10.5187/jast.2021.e117 Text en © Copyright 2021 Korean Society of Animal Science and Technology https://creativecommons.org/licenses/by-nc/4.0/This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Lee, DooHo Kim, Yeongkuk Chung, Yoonji Lee, Dongjae Seo, Dongwon Choi, Tae Jeong Lim, Dajeong Yoon, Duhak Lee, Seung Hwan Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle |
title | Accuracy of genotype imputation based on reference population size
and marker density in Hanwoo cattle |
title_full | Accuracy of genotype imputation based on reference population size
and marker density in Hanwoo cattle |
title_fullStr | Accuracy of genotype imputation based on reference population size
and marker density in Hanwoo cattle |
title_full_unstemmed | Accuracy of genotype imputation based on reference population size
and marker density in Hanwoo cattle |
title_short | Accuracy of genotype imputation based on reference population size
and marker density in Hanwoo cattle |
title_sort | accuracy of genotype imputation based on reference population size
and marker density in hanwoo cattle |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8672260/ https://www.ncbi.nlm.nih.gov/pubmed/34957440 http://dx.doi.org/10.5187/jast.2021.e117 |
work_keys_str_mv | AT leedooho accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle AT kimyeongkuk accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle AT chungyoonji accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle AT leedongjae accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle AT seodongwon accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle AT choitaejeong accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle AT limdajeong accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle AT yoonduhak accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle AT leeseunghwan accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle |