Cargando…
A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study
Precision medicine depends on high-accuracy individual-level genotype data. However, the whole-genome sequencing (WGS) is still not suitable for gigantic studies due to budget constraints. It is particularly important to construct highly accurate haplotype reference panel for genotype imputation. In...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10681741/ https://www.ncbi.nlm.nih.gov/pubmed/37870428 http://dx.doi.org/10.1093/nar/gkad779 |
_version_ | 1785142609996414976 |
---|---|
author | Yu, Canqing Lan, Xianmei Tao, Ye Guo, Yu Sun, Dianjianyi Qian, Puyi Zhou, Yuwen Walters, Robin G Li, Linxuan Zhu, Yunqing Zeng, Jingyu Millwood, Iona Y Guo, Ruidong Pei, Pei Yang, Tao Du, Huaidong Yang, Fan Yang, Ling Ren, Fangyi Chen, Yiping Chen, Fengzhen Jiang, Xiaosen Ye, Zhiqiang Dai, Lanlan Wei, Xiaofeng Xu, Xun Yang, Huanming Wang, Jian Chen, Zhengming Zhu, Huanhuan Lv, Jun Jin, Xin Li, Liming |
author_facet | Yu, Canqing Lan, Xianmei Tao, Ye Guo, Yu Sun, Dianjianyi Qian, Puyi Zhou, Yuwen Walters, Robin G Li, Linxuan Zhu, Yunqing Zeng, Jingyu Millwood, Iona Y Guo, Ruidong Pei, Pei Yang, Tao Du, Huaidong Yang, Fan Yang, Ling Ren, Fangyi Chen, Yiping Chen, Fengzhen Jiang, Xiaosen Ye, Zhiqiang Dai, Lanlan Wei, Xiaofeng Xu, Xun Yang, Huanming Wang, Jian Chen, Zhengming Zhu, Huanhuan Lv, Jun Jin, Xin Li, Liming |
author_sort | Yu, Canqing |
collection | PubMed |
description | Precision medicine depends on high-accuracy individual-level genotype data. However, the whole-genome sequencing (WGS) is still not suitable for gigantic studies due to budget constraints. It is particularly important to construct highly accurate haplotype reference panel for genotype imputation. In this study, we used 10 000 samples with medium-depth WGS to construct a reference panel that we named the CKB reference panel. By imputing microarray datasets, it showed that the CKB panel outperformed compared panels in terms of both the number of well-imputed variants and imputation accuracy. In addition, we have completed the imputation of 100 706 microarrays with the CKB panel, and the after-imputed data is the hitherto largest whole genome data of the Chinese population. Furthermore, in the GWAS analysis of real phenotype height, the number of tested SNPs tripled and the number of significant SNPs doubled after imputation. Finally, we developed an online server for offering free genotype imputation service based on the CKB reference panel (https://db.cngb.org/imputation/). We believe that the CKB panel is of great value for imputing microarray or low-coverage genotype data of Chinese population, and potentially mixed populations. The imputation-completed 100 706 microarray data are enormous and precious resources of population genetic studies for complex traits and diseases. |
format | Online Article Text |
id | pubmed-10681741 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-106817412023-10-23 A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study Yu, Canqing Lan, Xianmei Tao, Ye Guo, Yu Sun, Dianjianyi Qian, Puyi Zhou, Yuwen Walters, Robin G Li, Linxuan Zhu, Yunqing Zeng, Jingyu Millwood, Iona Y Guo, Ruidong Pei, Pei Yang, Tao Du, Huaidong Yang, Fan Yang, Ling Ren, Fangyi Chen, Yiping Chen, Fengzhen Jiang, Xiaosen Ye, Zhiqiang Dai, Lanlan Wei, Xiaofeng Xu, Xun Yang, Huanming Wang, Jian Chen, Zhengming Zhu, Huanhuan Lv, Jun Jin, Xin Li, Liming Nucleic Acids Res Genomics Precision medicine depends on high-accuracy individual-level genotype data. However, the whole-genome sequencing (WGS) is still not suitable for gigantic studies due to budget constraints. It is particularly important to construct highly accurate haplotype reference panel for genotype imputation. In this study, we used 10 000 samples with medium-depth WGS to construct a reference panel that we named the CKB reference panel. By imputing microarray datasets, it showed that the CKB panel outperformed compared panels in terms of both the number of well-imputed variants and imputation accuracy. In addition, we have completed the imputation of 100 706 microarrays with the CKB panel, and the after-imputed data is the hitherto largest whole genome data of the Chinese population. Furthermore, in the GWAS analysis of real phenotype height, the number of tested SNPs tripled and the number of significant SNPs doubled after imputation. Finally, we developed an online server for offering free genotype imputation service based on the CKB reference panel (https://db.cngb.org/imputation/). We believe that the CKB panel is of great value for imputing microarray or low-coverage genotype data of Chinese population, and potentially mixed populations. The imputation-completed 100 706 microarray data are enormous and precious resources of population genetic studies for complex traits and diseases. Oxford University Press 2023-10-23 /pmc/articles/PMC10681741/ /pubmed/37870428 http://dx.doi.org/10.1093/nar/gkad779 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Genomics Yu, Canqing Lan, Xianmei Tao, Ye Guo, Yu Sun, Dianjianyi Qian, Puyi Zhou, Yuwen Walters, Robin G Li, Linxuan Zhu, Yunqing Zeng, Jingyu Millwood, Iona Y Guo, Ruidong Pei, Pei Yang, Tao Du, Huaidong Yang, Fan Yang, Ling Ren, Fangyi Chen, Yiping Chen, Fengzhen Jiang, Xiaosen Ye, Zhiqiang Dai, Lanlan Wei, Xiaofeng Xu, Xun Yang, Huanming Wang, Jian Chen, Zhengming Zhu, Huanhuan Lv, Jun Jin, Xin Li, Liming A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study |
title | A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study |
title_full | A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study |
title_fullStr | A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study |
title_full_unstemmed | A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study |
title_short | A high-resolution haplotype-resolved Reference panel constructed from the China Kadoorie Biobank Study |
title_sort | high-resolution haplotype-resolved reference panel constructed from the china kadoorie biobank study |
topic | Genomics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10681741/ https://www.ncbi.nlm.nih.gov/pubmed/37870428 http://dx.doi.org/10.1093/nar/gkad779 |
work_keys_str_mv | AT yucanqing ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT lanxianmei ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT taoye ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT guoyu ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT sundianjianyi ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT qianpuyi ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT zhouyuwen ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT waltersrobing ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT lilinxuan ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT zhuyunqing ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT zengjingyu ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT millwoodionay ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT guoruidong ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT peipei ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT yangtao ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT duhuaidong ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT yangfan ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT yangling ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT renfangyi ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT chenyiping ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT chenfengzhen ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT jiangxiaosen ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT yezhiqiang ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT dailanlan ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT weixiaofeng ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT xuxun ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT yanghuanming ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT wangjian ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT chenzhengming ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT zhuhuanhuan ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT lvjun ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT jinxin ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT liliming ahighresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT yucanqing highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT lanxianmei highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT taoye highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT guoyu highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT sundianjianyi highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT qianpuyi highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT zhouyuwen highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT waltersrobing highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT lilinxuan highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT zhuyunqing highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT zengjingyu highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT millwoodionay highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT guoruidong highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT peipei highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT yangtao highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT duhuaidong highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT yangfan highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT yangling highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT renfangyi highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT chenyiping highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT chenfengzhen highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT jiangxiaosen highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT yezhiqiang highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT dailanlan highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT weixiaofeng highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT xuxun highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT yanghuanming highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT wangjian highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT chenzhengming highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT zhuhuanhuan highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT lvjun highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT jinxin highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy AT liliming highresolutionhaplotyperesolvedreferencepanelconstructedfromthechinakadooriebiobankstudy |