Cargando…

Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle

Recently, the cattle genome sequence has been completed, followed by developing a commercial single nucleotide polymorphism (SNP) chip panel in the animal genome industry. In order to increase statistical power for detecting quantitative trait locus (QTL), a number of animals should be genotyped. Ho...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, DooHo, Kim, Yeongkuk, Chung, Yoonji, Lee, Dongjae, Seo, Dongwon, Choi, Tae Jeong, Lim, Dajeong, Yoon, Duhak, Lee, Seung Hwan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Society of Animal Sciences and Technology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8672260/
https://www.ncbi.nlm.nih.gov/pubmed/34957440
http://dx.doi.org/10.5187/jast.2021.e117
_version_ 1784615323429765120
author Lee, DooHo
Kim, Yeongkuk
Chung, Yoonji
Lee, Dongjae
Seo, Dongwon
Choi, Tae Jeong
Lim, Dajeong
Yoon, Duhak
Lee, Seung Hwan
author_facet Lee, DooHo
Kim, Yeongkuk
Chung, Yoonji
Lee, Dongjae
Seo, Dongwon
Choi, Tae Jeong
Lim, Dajeong
Yoon, Duhak
Lee, Seung Hwan
author_sort Lee, DooHo
collection PubMed
description Recently, the cattle genome sequence has been completed, followed by developing a commercial single nucleotide polymorphism (SNP) chip panel in the animal genome industry. In order to increase statistical power for detecting quantitative trait locus (QTL), a number of animals should be genotyped. However, a high-density chip for many animals would be increasing the genotyping cost. Therefore, statistical inference of genotype imputation (low-density chip to high-density) will be useful in the animal industry. The purpose of this study is to investigate the effect of the reference population size and marker density on the imputation accuracy and to suggest the appropriate number of reference population sets for the imputation in Hanwoo cattle. A total of 3,821 Hanwoo cattle were divided into reference and validation populations. The reference sets consisted of 50k (38,916) marker data and different population sizes (500, 1,000, 1,500, 2,000, and 3,600). The validation sets consisted of four validation sets (Total 889) and the different marker density (5k [5,000], 10k [10,000], and 15k [15,000]). The accuracy of imputation was calculated by direct comparison of the true genotype and the imputed genotype. In conclusion, when the lowest marker density (5k) was used in the validation set, according to the reference population size, the imputation accuracy was 0.793 to 0.929. On the other hand, when the highest marker density (15k), according to the reference population size, the imputation accuracy was 0.904 to 0.967. Moreover, the reference population size should be more than 1,000 to obtain at least 88% imputation accuracy in Hanwoo cattle.
format Online
Article
Text
id pubmed-8672260
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Korean Society of Animal Sciences and Technology
record_format MEDLINE/PubMed
spelling pubmed-86722602021-12-23 Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle Lee, DooHo Kim, Yeongkuk Chung, Yoonji Lee, Dongjae Seo, Dongwon Choi, Tae Jeong Lim, Dajeong Yoon, Duhak Lee, Seung Hwan J Anim Sci Technol Research Article Recently, the cattle genome sequence has been completed, followed by developing a commercial single nucleotide polymorphism (SNP) chip panel in the animal genome industry. In order to increase statistical power for detecting quantitative trait locus (QTL), a number of animals should be genotyped. However, a high-density chip for many animals would be increasing the genotyping cost. Therefore, statistical inference of genotype imputation (low-density chip to high-density) will be useful in the animal industry. The purpose of this study is to investigate the effect of the reference population size and marker density on the imputation accuracy and to suggest the appropriate number of reference population sets for the imputation in Hanwoo cattle. A total of 3,821 Hanwoo cattle were divided into reference and validation populations. The reference sets consisted of 50k (38,916) marker data and different population sizes (500, 1,000, 1,500, 2,000, and 3,600). The validation sets consisted of four validation sets (Total 889) and the different marker density (5k [5,000], 10k [10,000], and 15k [15,000]). The accuracy of imputation was calculated by direct comparison of the true genotype and the imputed genotype. In conclusion, when the lowest marker density (5k) was used in the validation set, according to the reference population size, the imputation accuracy was 0.793 to 0.929. On the other hand, when the highest marker density (15k), according to the reference population size, the imputation accuracy was 0.904 to 0.967. Moreover, the reference population size should be more than 1,000 to obtain at least 88% imputation accuracy in Hanwoo cattle. Korean Society of Animal Sciences and Technology 2021-11 2021-11-30 /pmc/articles/PMC8672260/ /pubmed/34957440 http://dx.doi.org/10.5187/jast.2021.e117 Text en © Copyright 2021 Korean Society of Animal Science and Technology https://creativecommons.org/licenses/by-nc/4.0/This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Lee, DooHo
Kim, Yeongkuk
Chung, Yoonji
Lee, Dongjae
Seo, Dongwon
Choi, Tae Jeong
Lim, Dajeong
Yoon, Duhak
Lee, Seung Hwan
Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle
title Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle
title_full Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle
title_fullStr Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle
title_full_unstemmed Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle
title_short Accuracy of genotype imputation based on reference population size and marker density in Hanwoo cattle
title_sort accuracy of genotype imputation based on reference population size and marker density in hanwoo cattle
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8672260/
https://www.ncbi.nlm.nih.gov/pubmed/34957440
http://dx.doi.org/10.5187/jast.2021.e117
work_keys_str_mv AT leedooho accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle
AT kimyeongkuk accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle
AT chungyoonji accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle
AT leedongjae accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle
AT seodongwon accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle
AT choitaejeong accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle
AT limdajeong accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle
AT yoonduhak accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle
AT leeseunghwan accuracyofgenotypeimputationbasedonreferencepopulationsizeandmarkerdensityinhanwoocattle