Cargando…

HKG: an open genetic variant database of 205 Hong Kong cantonese exomes

HKG is the first fully accessible variant database for Hong Kong Cantonese, constructed from 205 novel whole-exome sequencing data. There has long been a research gap in the understanding of the genetic architecture of southern Chinese subgroups, including Hong Kong Cantonese. HKG detected 196 325 h...

Descripción completa

Detalles Bibliográficos
Autores principales: Ou, Min, Leung, Henry Chi-Ming, Leung, Amy Wing-Sze, Luk, Ho-Ming, Yan, Bin, Liu, Chi-Man, Tong, Tony Ming-For, Mok, Myth Tsz-Shun, Ko, Wallace Ming-Yuen, Law, Wai-Chun, Lam, Tak-Wah, Lo, Ivan Fai-Man, Luo, Ruibang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8826781/
https://www.ncbi.nlm.nih.gov/pubmed/35156024
http://dx.doi.org/10.1093/nargab/lqac005
Descripción
Sumario:HKG is the first fully accessible variant database for Hong Kong Cantonese, constructed from 205 novel whole-exome sequencing data. There has long been a research gap in the understanding of the genetic architecture of southern Chinese subgroups, including Hong Kong Cantonese. HKG detected 196 325 high-quality variants with 5.93% being novel, and 25 472 variants were found to be unique in HKG compared to three Chinese populations sampled from 1000 Genomes (CHN). PCA illustrates the uniqueness of HKG in CHN, and the admixture study estimated the ancestral composition of HKG and CHN, with a gradient change from north to south, consistent with their geological distribution. ClinVar, CIViC and PharmGKB annotated 599 clinically significant variants and 360 putative loss-of-function variants, substantiating our understanding of population characteristics for future medical development. Among the novel variants, 96.57% were singleton and 6.85% were of high impact. With a good representation of Hong Kong Cantonese, we demonstrated better variant imputation using reference with the addition of HKG data, thus successfully filling the data gap in southern Chinese to facilitate the regional and global development of population genetics.