Cargando…

Sequence and analysis of a whole genome from Kuwaiti population subgroup of Persian ancestry

BACKGROUND: The 1000 Genome project paved the way for sequencing diverse human populations. New genome projects are being established to sequence underrepresented populations helping in understanding human genetic diversity. The Kuwait Genome Project an initiative to sequence individual genomes from...

Descripción completa

Detalles Bibliográficos
Autores principales: Thareja, Gaurav, John, Sumi Elsa, Hebbar, Prashantha, Behbehani, Kazem, Thanaraj, Thangavel Alphonse, Alsmadi, Osama
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4336699/
https://www.ncbi.nlm.nih.gov/pubmed/25765185
http://dx.doi.org/10.1186/s12864-015-1233-x
_version_ 1782358501701451776
author Thareja, Gaurav
John, Sumi Elsa
Hebbar, Prashantha
Behbehani, Kazem
Thanaraj, Thangavel Alphonse
Alsmadi, Osama
author_facet Thareja, Gaurav
John, Sumi Elsa
Hebbar, Prashantha
Behbehani, Kazem
Thanaraj, Thangavel Alphonse
Alsmadi, Osama
author_sort Thareja, Gaurav
collection PubMed
description BACKGROUND: The 1000 Genome project paved the way for sequencing diverse human populations. New genome projects are being established to sequence underrepresented populations helping in understanding human genetic diversity. The Kuwait Genome Project an initiative to sequence individual genomes from the three subgroups of Kuwaiti population namely, Saudi Arabian tribe; “tent-dwelling” Bedouin; and Persian, attributing their ancestry to different regions in Arabian Peninsula and to modern-day Iran (West Asia). These subgroups were in line with settlement history and are confirmed by genetic studies. In this work, we report whole genome sequence of a Kuwaiti native from Persian subgroup at >37X coverage. RESULTS: We document 3,573,824 SNPs, 404,090 insertions/deletions, and 11,138 structural variations. Out of the reported SNPs and indels, 85,939 are novel. We identify 295 ‘loss-of-function’ and 2,314 ’deleterious’ coding variants, some of which carry homozygous genotypes in the sequenced genome; the associated phenotypes include pharmacogenomic traits such as greater triglyceride lowering ability with fenofibrate treatment, and requirement of high warfarin dosage to elicit anticoagulation response. 6,328 non-coding SNPs associate with 811 phenotype traits: in congruence with medical history of the participant for Type 2 diabetes and β-Thalassemia, and of participant’s family for migraine, 72 (of 159 known) Type 2 diabetes, 3 (of 4) β-Thalassemia, and 76 (of 169) migraine variants are seen in the genome. Intergenome comparisons based on shared disease-causing variants, positions the sequenced genome between Asian and European genomes in congruence with geographical location of the region. On comparison, bead arrays perform better than sequencing platforms in correctly calling genotypes in low-coverage sequenced genome regions however in the event of novel SNP or indel near genotype calling position can lead to false calls using bead arrays. CONCLUSIONS: We report, for the first time, reference genome resource for the population of Persian ancestry. The resource provides a starting point for designing large-scale genetic studies in Peninsula including Kuwait, and Persian population. Such efforts on populations under-represented in global genome variation surveys help augment current knowledge on human genome diversity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1233-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4336699
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43366992015-02-23 Sequence and analysis of a whole genome from Kuwaiti population subgroup of Persian ancestry Thareja, Gaurav John, Sumi Elsa Hebbar, Prashantha Behbehani, Kazem Thanaraj, Thangavel Alphonse Alsmadi, Osama BMC Genomics Research Article BACKGROUND: The 1000 Genome project paved the way for sequencing diverse human populations. New genome projects are being established to sequence underrepresented populations helping in understanding human genetic diversity. The Kuwait Genome Project an initiative to sequence individual genomes from the three subgroups of Kuwaiti population namely, Saudi Arabian tribe; “tent-dwelling” Bedouin; and Persian, attributing their ancestry to different regions in Arabian Peninsula and to modern-day Iran (West Asia). These subgroups were in line with settlement history and are confirmed by genetic studies. In this work, we report whole genome sequence of a Kuwaiti native from Persian subgroup at >37X coverage. RESULTS: We document 3,573,824 SNPs, 404,090 insertions/deletions, and 11,138 structural variations. Out of the reported SNPs and indels, 85,939 are novel. We identify 295 ‘loss-of-function’ and 2,314 ’deleterious’ coding variants, some of which carry homozygous genotypes in the sequenced genome; the associated phenotypes include pharmacogenomic traits such as greater triglyceride lowering ability with fenofibrate treatment, and requirement of high warfarin dosage to elicit anticoagulation response. 6,328 non-coding SNPs associate with 811 phenotype traits: in congruence with medical history of the participant for Type 2 diabetes and β-Thalassemia, and of participant’s family for migraine, 72 (of 159 known) Type 2 diabetes, 3 (of 4) β-Thalassemia, and 76 (of 169) migraine variants are seen in the genome. Intergenome comparisons based on shared disease-causing variants, positions the sequenced genome between Asian and European genomes in congruence with geographical location of the region. On comparison, bead arrays perform better than sequencing platforms in correctly calling genotypes in low-coverage sequenced genome regions however in the event of novel SNP or indel near genotype calling position can lead to false calls using bead arrays. CONCLUSIONS: We report, for the first time, reference genome resource for the population of Persian ancestry. The resource provides a starting point for designing large-scale genetic studies in Peninsula including Kuwait, and Persian population. Such efforts on populations under-represented in global genome variation surveys help augment current knowledge on human genome diversity. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1233-x) contains supplementary material, which is available to authorized users. BioMed Central 2015-02-18 /pmc/articles/PMC4336699/ /pubmed/25765185 http://dx.doi.org/10.1186/s12864-015-1233-x Text en © Thareja et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Thareja, Gaurav
John, Sumi Elsa
Hebbar, Prashantha
Behbehani, Kazem
Thanaraj, Thangavel Alphonse
Alsmadi, Osama
Sequence and analysis of a whole genome from Kuwaiti population subgroup of Persian ancestry
title Sequence and analysis of a whole genome from Kuwaiti population subgroup of Persian ancestry
title_full Sequence and analysis of a whole genome from Kuwaiti population subgroup of Persian ancestry
title_fullStr Sequence and analysis of a whole genome from Kuwaiti population subgroup of Persian ancestry
title_full_unstemmed Sequence and analysis of a whole genome from Kuwaiti population subgroup of Persian ancestry
title_short Sequence and analysis of a whole genome from Kuwaiti population subgroup of Persian ancestry
title_sort sequence and analysis of a whole genome from kuwaiti population subgroup of persian ancestry
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4336699/
https://www.ncbi.nlm.nih.gov/pubmed/25765185
http://dx.doi.org/10.1186/s12864-015-1233-x
work_keys_str_mv AT tharejagaurav sequenceandanalysisofawholegenomefromkuwaitipopulationsubgroupofpersianancestry
AT johnsumielsa sequenceandanalysisofawholegenomefromkuwaitipopulationsubgroupofpersianancestry
AT hebbarprashantha sequenceandanalysisofawholegenomefromkuwaitipopulationsubgroupofpersianancestry
AT behbehanikazem sequenceandanalysisofawholegenomefromkuwaitipopulationsubgroupofpersianancestry
AT thanarajthangavelalphonse sequenceandanalysisofawholegenomefromkuwaitipopulationsubgroupofpersianancestry
AT alsmadiosama sequenceandanalysisofawholegenomefromkuwaitipopulationsubgroupofpersianancestry