Cargando…

Identification of single nucleotide variants in the Moroccan population by whole-genome sequencing

BACKGROUND: Large-scale human sequencing projects have described around a hundred-million single nucleotide variants (SNVs). These studies have predominately involved individuals with European ancestry despite the fact that genetic diversity is expected to be highest in Africa where Homo sapiens evo...

Descripción completa

Detalles Bibliográficos
Autores principales: Crooks, Lucy, Cooper-Knock, Johnathan, Heath, Paul R., Bouhouche, Ahmed, Elfahime, Mostafa, Azzouz, Mimoun, Bakri, Youssef, Adnaoui, Mohammed, Ibrahimi, Azeddine, Amzazi, Saaïd, Tazi-Ahnini, Rachid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7507649/
https://www.ncbi.nlm.nih.gov/pubmed/32957965
http://dx.doi.org/10.1186/s12863-020-00917-4
_version_ 1783585271386734592
author Crooks, Lucy
Cooper-Knock, Johnathan
Heath, Paul R.
Bouhouche, Ahmed
Elfahime, Mostafa
Azzouz, Mimoun
Bakri, Youssef
Adnaoui, Mohammed
Ibrahimi, Azeddine
Amzazi, Saaïd
Tazi-Ahnini, Rachid
author_facet Crooks, Lucy
Cooper-Knock, Johnathan
Heath, Paul R.
Bouhouche, Ahmed
Elfahime, Mostafa
Azzouz, Mimoun
Bakri, Youssef
Adnaoui, Mohammed
Ibrahimi, Azeddine
Amzazi, Saaïd
Tazi-Ahnini, Rachid
author_sort Crooks, Lucy
collection PubMed
description BACKGROUND: Large-scale human sequencing projects have described around a hundred-million single nucleotide variants (SNVs). These studies have predominately involved individuals with European ancestry despite the fact that genetic diversity is expected to be highest in Africa where Homo sapiens evolved and has maintained a large population for the longest time. The African Genome Variation Project examined several African populations but these were all located south of the Sahara. Morocco is on the northwest coast of Africa and mostly lies north of the Sahara, which makes it very attractive for studying genetic diversity. The ancestry of present-day Moroccans is unknown and may be substantially different from Africans found South of the Sahara desert, Recent genomic data of Taforalt individuals in Eastern Morocco revealed 15,000-year-old modern humans and suggested that North African individuals may be genetically distinct from previously studied African populations. RESULTS: We present SNVs discovered by whole genome sequencing (WGS) of three Moroccans. From a total of 5.9 million SNVs detected, over 200,000 were not identified by 1000G and were not in the extensive gnomAD database. We summarise the SNVs by genomic position, type of sequence gene context and effect on proteins encoded by the sequence. Analysis of the overall genomic information of the Moroccan individuals to individuals from 1000G supports the Moroccan population being distinct from both sub-Saharan African and European populations. CONCLUSIONS: We conclude that Moroccan samples are genetically distinct and lie in the middle of the previously observed cline between populations of European and African ancestry. WGS of Moroccan individuals can identify a large number of novel SNVs and aid in functional characterisation of the genome.
format Online
Article
Text
id pubmed-7507649
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-75076492020-09-23 Identification of single nucleotide variants in the Moroccan population by whole-genome sequencing Crooks, Lucy Cooper-Knock, Johnathan Heath, Paul R. Bouhouche, Ahmed Elfahime, Mostafa Azzouz, Mimoun Bakri, Youssef Adnaoui, Mohammed Ibrahimi, Azeddine Amzazi, Saaïd Tazi-Ahnini, Rachid BMC Genet Research Article BACKGROUND: Large-scale human sequencing projects have described around a hundred-million single nucleotide variants (SNVs). These studies have predominately involved individuals with European ancestry despite the fact that genetic diversity is expected to be highest in Africa where Homo sapiens evolved and has maintained a large population for the longest time. The African Genome Variation Project examined several African populations but these were all located south of the Sahara. Morocco is on the northwest coast of Africa and mostly lies north of the Sahara, which makes it very attractive for studying genetic diversity. The ancestry of present-day Moroccans is unknown and may be substantially different from Africans found South of the Sahara desert, Recent genomic data of Taforalt individuals in Eastern Morocco revealed 15,000-year-old modern humans and suggested that North African individuals may be genetically distinct from previously studied African populations. RESULTS: We present SNVs discovered by whole genome sequencing (WGS) of three Moroccans. From a total of 5.9 million SNVs detected, over 200,000 were not identified by 1000G and were not in the extensive gnomAD database. We summarise the SNVs by genomic position, type of sequence gene context and effect on proteins encoded by the sequence. Analysis of the overall genomic information of the Moroccan individuals to individuals from 1000G supports the Moroccan population being distinct from both sub-Saharan African and European populations. CONCLUSIONS: We conclude that Moroccan samples are genetically distinct and lie in the middle of the previously observed cline between populations of European and African ancestry. WGS of Moroccan individuals can identify a large number of novel SNVs and aid in functional characterisation of the genome. BioMed Central 2020-09-21 /pmc/articles/PMC7507649/ /pubmed/32957965 http://dx.doi.org/10.1186/s12863-020-00917-4 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Crooks, Lucy
Cooper-Knock, Johnathan
Heath, Paul R.
Bouhouche, Ahmed
Elfahime, Mostafa
Azzouz, Mimoun
Bakri, Youssef
Adnaoui, Mohammed
Ibrahimi, Azeddine
Amzazi, Saaïd
Tazi-Ahnini, Rachid
Identification of single nucleotide variants in the Moroccan population by whole-genome sequencing
title Identification of single nucleotide variants in the Moroccan population by whole-genome sequencing
title_full Identification of single nucleotide variants in the Moroccan population by whole-genome sequencing
title_fullStr Identification of single nucleotide variants in the Moroccan population by whole-genome sequencing
title_full_unstemmed Identification of single nucleotide variants in the Moroccan population by whole-genome sequencing
title_short Identification of single nucleotide variants in the Moroccan population by whole-genome sequencing
title_sort identification of single nucleotide variants in the moroccan population by whole-genome sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7507649/
https://www.ncbi.nlm.nih.gov/pubmed/32957965
http://dx.doi.org/10.1186/s12863-020-00917-4
work_keys_str_mv AT crookslucy identificationofsinglenucleotidevariantsinthemoroccanpopulationbywholegenomesequencing
AT cooperknockjohnathan identificationofsinglenucleotidevariantsinthemoroccanpopulationbywholegenomesequencing
AT heathpaulr identificationofsinglenucleotidevariantsinthemoroccanpopulationbywholegenomesequencing
AT bouhoucheahmed identificationofsinglenucleotidevariantsinthemoroccanpopulationbywholegenomesequencing
AT elfahimemostafa identificationofsinglenucleotidevariantsinthemoroccanpopulationbywholegenomesequencing
AT azzouzmimoun identificationofsinglenucleotidevariantsinthemoroccanpopulationbywholegenomesequencing
AT bakriyoussef identificationofsinglenucleotidevariantsinthemoroccanpopulationbywholegenomesequencing
AT adnaouimohammed identificationofsinglenucleotidevariantsinthemoroccanpopulationbywholegenomesequencing
AT ibrahimiazeddine identificationofsinglenucleotidevariantsinthemoroccanpopulationbywholegenomesequencing
AT amzazisaaid identificationofsinglenucleotidevariantsinthemoroccanpopulationbywholegenomesequencing
AT taziahninirachid identificationofsinglenucleotidevariantsinthemoroccanpopulationbywholegenomesequencing