Cargando…

A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study

Precision medicine depends on high-accuracy individual-level genotype data. However, the whole-genome sequencing (WGS) is still not suitable for gigantic studies due to budget constraints. It is particularly important to construct highly accurate haplotype reference panel for genotype imputation. In...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Canqing, Lan, Xianmei, Tao, Ye, Guo, Yu, Sun, Dianjianyi, Qian, Puyi, Zhou, Yuwen, Walters, Robin G, Li, Linxuan, Zhu, Yunqing, Zeng, Jingyu, Millwood, Iona Y, Guo, Ruidong, Pei, Pei, Yang, Tao, Du, Huaidong, Yang, Fan, Yang, Ling, Ren, Fangyi, Chen, Yiping, Chen, Fengzhen, Jiang, Xiaosen, Ye, Zhiqiang, Dai, Lanlan, Wei, Xiaofeng, Xu, Xun, Yang, Huanming, Wang, Jian, Chen, Zhengming, Zhu, Huanhuan, Lv, Jun, Jin, Xin, Li, Liming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10681741/
https://www.ncbi.nlm.nih.gov/pubmed/37870428
http://dx.doi.org/10.1093/nar/gkad779
_version_ 1785142609996414976
author Yu, Canqing
Lan, Xianmei
Tao, Ye
Guo, Yu
Sun, Dianjianyi
Qian, Puyi
Zhou, Yuwen
Walters, Robin G
Li, Linxuan
Zhu, Yunqing
Zeng, Jingyu
Millwood, Iona Y
Guo, Ruidong
Pei, Pei
Yang, Tao
Du, Huaidong
Yang, Fan
Yang, Ling
Ren, Fangyi
Chen, Yiping
Chen, Fengzhen
Jiang, Xiaosen
Ye, Zhiqiang
Dai, Lanlan
Wei, Xiaofeng
Xu, Xun
Yang, Huanming
Wang, Jian
Chen, Zhengming
Zhu, Huanhuan
Lv, Jun
Jin, Xin
Li, Liming
author_facet Yu, Canqing
Lan, Xianmei
Tao, Ye
Guo, Yu
Sun, Dianjianyi
Qian, Puyi
Zhou, Yuwen
Walters, Robin G
Li, Linxuan
Zhu, Yunqing
Zeng, Jingyu
Millwood, Iona Y
Guo, Ruidong
Pei, Pei
Yang, Tao
Du, Huaidong
Yang, Fan
Yang, Ling
Ren, Fangyi
Chen, Yiping
Chen, Fengzhen
Jiang, Xiaosen
Ye, Zhiqiang
Dai, Lanlan
Wei, Xiaofeng
Xu, Xun
Yang, Huanming
Wang, Jian
Chen, Zhengming
Zhu, Huanhuan
Lv, Jun
Jin, Xin
Li, Liming
author_sort Yu, Canqing
collection PubMed
description Precision medicine depends on high-accuracy individual-level genotype data. However, the whole-genome sequencing (WGS) is still not suitable for gigantic studies due to budget constraints. It is particularly important to construct highly accurate haplotype reference panel for genotype imputation. In this study, we used 10 000 samples with medium-depth WGS to construct a reference panel that we named the CKB reference panel. By imputing microarray datasets, it showed that the CKB panel outperformed compared panels in terms of both the number of well-imputed variants and imputation accuracy. In addition, we have completed the imputation of 100 706 microarrays with the CKB panel, and the after-imputed data is the hitherto largest whole genome data of the Chinese population. Furthermore, in the GWAS analysis of real phenotype height, the number of tested SNPs tripled and the number of significant SNPs doubled after imputation. Finally, we developed an online server for offering free genotype imputation service based on the CKB reference panel (https://db.cngb.org/imputation/). We believe that the CKB panel is of great value for imputing microarray or low-coverage genotype data of Chinese population, and potentially mixed populations. The imputation-completed 100 706 microarray data are enormous and precious resources of population genetic studies for complex traits and diseases.
format Online
Article
Text
id pubmed-10681741
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-106817412023-10-23 A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study Yu, Canqing Lan, Xianmei Tao, Ye Guo, Yu Sun, Dianjianyi Qian, Puyi Zhou, Yuwen Walters, Robin G Li, Linxuan Zhu, Yunqing Zeng, Jingyu Millwood, Iona Y Guo, Ruidong Pei, Pei Yang, Tao Du, Huaidong Yang, Fan Yang, Ling Ren, Fangyi Chen, Yiping Chen, Fengzhen Jiang, Xiaosen Ye, Zhiqiang Dai, Lanlan Wei, Xiaofeng Xu, Xun Yang, Huanming Wang, Jian Chen, Zhengming Zhu, Huanhuan Lv, Jun Jin, Xin Li, Liming Nucleic Acids Res Genomics Precision medicine depends on high-accuracy individual-level genotype data. However, the whole-genome sequencing (WGS) is still not suitable for gigantic studies due to budget constraints. It is particularly important to construct highly accurate haplotype reference panel for genotype imputation. In this study, we used 10 000 samples with medium-depth WGS to construct a reference panel that we named the CKB reference panel. By imputing microarray datasets, it showed that the CKB panel outperformed compared panels in terms of both the number of well-imputed variants and imputation accuracy. In addition, we have completed the imputation of 100 706 microarrays with the CKB panel, and the after-imputed data is the hitherto largest whole genome data of the Chinese population. Furthermore, in the GWAS analysis of real phenotype height, the number of tested SNPs tripled and the number of significant SNPs doubled after imputation. Finally, we developed an online server for offering free genotype imputation service based on the CKB reference panel (https://db.cngb.org/imputation/). We believe that the CKB panel is of great value for imputing microarray or low-coverage genotype data of Chinese population, and potentially mixed populations. The imputation-completed 100 706 microarray data are enormous and precious resources of population genetic studies for complex traits and diseases. Oxford University Press 2023-10-23 /pmc/articles/PMC10681741/ /pubmed/37870428 http://dx.doi.org/10.1093/nar/gkad779 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genomics
Yu, Canqing
Lan, Xianmei
Tao, Ye
Guo, Yu
Sun, Dianjianyi
Qian, Puyi
Zhou, Yuwen
Walters, Robin G
Li, Linxuan
Zhu, Yunqing
Zeng, Jingyu
Millwood, Iona Y
Guo, Ruidong
Pei, Pei
Yang, Tao
Du, Huaidong
Yang, Fan
Yang, Ling
Ren, Fangyi
Chen, Yiping
Chen, Fengzhen
Jiang, Xiaosen
Ye, Zhiqiang
Dai, Lanlan
Wei, Xiaofeng
Xu, Xun
Yang, Huanming
Wang, Jian
Chen, Zhengming
Zhu, Huanhuan
Lv, Jun
Jin, Xin
Li, Liming
A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study
title A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study
title_full A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study
title_fullStr A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study
title_full_unstemmed A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study
title_short A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study
title_sort high-resolution haplotype-resolved reference panel constructed from the china kadoorie biobank study
topic Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10681741/
https://www.ncbi.nlm.nih.gov/pubmed/37870428
http://dx.doi.org/10.1093/nar/gkad779
work_keys_str_mv AT yucanqing ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT lanxianmei ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT taoye ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT guoyu ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT sundianjianyi ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT qianpuyi ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT zhouyuwen ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT waltersrobing ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT lilinxuan ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT zhuyunqing ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT zengjingyu ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT millwoodionay ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT guoruidong ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT peipei ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT yangtao ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT duhuaidong ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT yangfan ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT yangling ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT renfangyi ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT chenyiping ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT chenfengzhen ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT jiangxiaosen ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT yezhiqiang ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT dailanlan ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT weixiaofeng ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT xuxun ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT yanghuanming ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT wangjian ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT chenzhengming ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT zhuhuanhuan ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT lvjun ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT jinxin ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT liliming ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT yucanqing highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT lanxianmei highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT taoye highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT guoyu highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT sundianjianyi highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT qianpuyi highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT zhouyuwen highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT waltersrobing highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT lilinxuan highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT zhuyunqing highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT zengjingyu highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT millwoodionay highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT guoruidong highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT peipei highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT yangtao highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT duhuaidong highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT yangfan highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT yangling highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT renfangyi highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT chenyiping highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT chenfengzhen highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT jiangxiaosen highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT yezhiqiang highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT dailanlan highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT weixiaofeng highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT xuxun highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT yanghuanming highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT wangjian highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT chenzhengming highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT zhuhuanhuan highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT lvjun highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT jinxin highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy
AT liliming highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy