Cargando…

Copy number variations in the genome of the Qatari population

BACKGROUND: The populations of the Arabian Peninsula remain the least represented in public genetic databases, both in terms of single nucleotide variants and of larger genomic mutations. We present the first high-resolution copy number variation (CNV) map for a Gulf Arab population, using a hybrid...

Descripción completa

Detalles Bibliográficos
Autores principales: Fakhro, Khalid A., Yousri, Noha A., Rodriguez-Flores, Juan L., Robay, Amal, Staudt, Michelle R., Agosto-Perez, Francisco, Salit, Jacqueline, Malek, Joel A., Suhre, Karsten, Jayyousi, Amin, Zirie, Mahmoud, Stadler, Dora, Mezey, Jason G., Crystal, Ronald G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4618522/
https://www.ncbi.nlm.nih.gov/pubmed/26490036
http://dx.doi.org/10.1186/s12864-015-1991-5
_version_ 1782396935902068736
author Fakhro, Khalid A.
Yousri, Noha A.
Rodriguez-Flores, Juan L.
Robay, Amal
Staudt, Michelle R.
Agosto-Perez, Francisco
Salit, Jacqueline
Malek, Joel A.
Suhre, Karsten
Jayyousi, Amin
Zirie, Mahmoud
Stadler, Dora
Mezey, Jason G.
Crystal, Ronald G.
author_facet Fakhro, Khalid A.
Yousri, Noha A.
Rodriguez-Flores, Juan L.
Robay, Amal
Staudt, Michelle R.
Agosto-Perez, Francisco
Salit, Jacqueline
Malek, Joel A.
Suhre, Karsten
Jayyousi, Amin
Zirie, Mahmoud
Stadler, Dora
Mezey, Jason G.
Crystal, Ronald G.
author_sort Fakhro, Khalid A.
collection PubMed
description BACKGROUND: The populations of the Arabian Peninsula remain the least represented in public genetic databases, both in terms of single nucleotide variants and of larger genomic mutations. We present the first high-resolution copy number variation (CNV) map for a Gulf Arab population, using a hybrid approach that integrates array genotyping intensity data and next-generation sequencing reads to call CNVs in the Qatari population. METHODS: CNVs were detected in 97 unrelated Qatari individuals by running two calling algorithms on each of two primary datasets: high-resolution genotyping (Illumina Omni 2.5M) and high depth whole-genome sequencing (Illumina PE 100bp). The four call-sets were integrated to identify high confidence CNV regions, which were subsequently annotated for putative functional effect and compared to public databases of CNVs in other populations. The availability of genome sequence was leveraged to identify tagging SNPs in high LD with common deletions in this population, enabling their imputation from genotyping experiments in the future. RESULTS: Genotyping intensities and genome sequencing data from 97 Qataris were analyzed with four different algorithms and integrated to discover 16,660 high confidence CNV regions (CNVRs) in the total population, affecting ~28 Mb in the median Qatari genome. Up to 40 % of all CNVs affected genes, including novel CNVs affecting Mendelian disease genes, segregating at different frequencies in the 3 major Qatari subpopulations, including those with Bedouin, Persian/South Asian, and African ancestry. Consistent with high consanguinity levels in the Bedouin subpopulation, we found an increased burden for homozygous deletions in this group. In comparison to known CNVs in the comprehensive Database of Genomic Variants, we found that 5 % of all CNVRs in Qataris were completely novel, with an enrichment of CNVs affecting several known chromosomal disorder loci and genes known to regulate sugar metabolism and type 2 diabetes in the Qatari cohort. Finally, we leveraged the availability of genome sequence to find suitable tagging SNPs for common deletions in this population. CONCLUSION: We combine four independently generated datasets from 97 individuals to study CNVs for the first time at high-resolution in a Gulf Arab population. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1991-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4618522
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46185222015-10-25 Copy number variations in the genome of the Qatari population Fakhro, Khalid A. Yousri, Noha A. Rodriguez-Flores, Juan L. Robay, Amal Staudt, Michelle R. Agosto-Perez, Francisco Salit, Jacqueline Malek, Joel A. Suhre, Karsten Jayyousi, Amin Zirie, Mahmoud Stadler, Dora Mezey, Jason G. Crystal, Ronald G. BMC Genomics Research Article BACKGROUND: The populations of the Arabian Peninsula remain the least represented in public genetic databases, both in terms of single nucleotide variants and of larger genomic mutations. We present the first high-resolution copy number variation (CNV) map for a Gulf Arab population, using a hybrid approach that integrates array genotyping intensity data and next-generation sequencing reads to call CNVs in the Qatari population. METHODS: CNVs were detected in 97 unrelated Qatari individuals by running two calling algorithms on each of two primary datasets: high-resolution genotyping (Illumina Omni 2.5M) and high depth whole-genome sequencing (Illumina PE 100bp). The four call-sets were integrated to identify high confidence CNV regions, which were subsequently annotated for putative functional effect and compared to public databases of CNVs in other populations. The availability of genome sequence was leveraged to identify tagging SNPs in high LD with common deletions in this population, enabling their imputation from genotyping experiments in the future. RESULTS: Genotyping intensities and genome sequencing data from 97 Qataris were analyzed with four different algorithms and integrated to discover 16,660 high confidence CNV regions (CNVRs) in the total population, affecting ~28 Mb in the median Qatari genome. Up to 40 % of all CNVs affected genes, including novel CNVs affecting Mendelian disease genes, segregating at different frequencies in the 3 major Qatari subpopulations, including those with Bedouin, Persian/South Asian, and African ancestry. Consistent with high consanguinity levels in the Bedouin subpopulation, we found an increased burden for homozygous deletions in this group. In comparison to known CNVs in the comprehensive Database of Genomic Variants, we found that 5 % of all CNVRs in Qataris were completely novel, with an enrichment of CNVs affecting several known chromosomal disorder loci and genes known to regulate sugar metabolism and type 2 diabetes in the Qatari cohort. Finally, we leveraged the availability of genome sequence to find suitable tagging SNPs for common deletions in this population. CONCLUSION: We combine four independently generated datasets from 97 individuals to study CNVs for the first time at high-resolution in a Gulf Arab population. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1991-5) contains supplementary material, which is available to authorized users. BioMed Central 2015-10-22 /pmc/articles/PMC4618522/ /pubmed/26490036 http://dx.doi.org/10.1186/s12864-015-1991-5 Text en © Fakhro et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Fakhro, Khalid A.
Yousri, Noha A.
Rodriguez-Flores, Juan L.
Robay, Amal
Staudt, Michelle R.
Agosto-Perez, Francisco
Salit, Jacqueline
Malek, Joel A.
Suhre, Karsten
Jayyousi, Amin
Zirie, Mahmoud
Stadler, Dora
Mezey, Jason G.
Crystal, Ronald G.
Copy number variations in the genome of the Qatari population
title Copy number variations in the genome of the Qatari population
title_full Copy number variations in the genome of the Qatari population
title_fullStr Copy number variations in the genome of the Qatari population
title_full_unstemmed Copy number variations in the genome of the Qatari population
title_short Copy number variations in the genome of the Qatari population
title_sort copy number variations in the genome of the qatari population
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4618522/
https://www.ncbi.nlm.nih.gov/pubmed/26490036
http://dx.doi.org/10.1186/s12864-015-1991-5
work_keys_str_mv AT fakhrokhalida copynumbervariationsinthegenomeoftheqataripopulation
AT yousrinohaa copynumbervariationsinthegenomeoftheqataripopulation
AT rodriguezfloresjuanl copynumbervariationsinthegenomeoftheqataripopulation
AT robayamal copynumbervariationsinthegenomeoftheqataripopulation
AT staudtmicheller copynumbervariationsinthegenomeoftheqataripopulation
AT agostoperezfrancisco copynumbervariationsinthegenomeoftheqataripopulation
AT salitjacqueline copynumbervariationsinthegenomeoftheqataripopulation
AT malekjoela copynumbervariationsinthegenomeoftheqataripopulation
AT suhrekarsten copynumbervariationsinthegenomeoftheqataripopulation
AT jayyousiamin copynumbervariationsinthegenomeoftheqataripopulation
AT ziriemahmoud copynumbervariationsinthegenomeoftheqataripopulation
AT stadlerdora copynumbervariationsinthegenomeoftheqataripopulation
AT mezeyjasong copynumbervariationsinthegenomeoftheqataripopulation
AT crystalronaldg copynumbervariationsinthegenomeoftheqataripopulation