Cargando…

High level of inbreeding in final phase of 1000 Genomes Project

The 1000 Genomes Project provides a unique source of whole genome sequencing data for studies of human population genetics and human diseases. The last release of this project includes more than 2,500 sequenced individuals from 26 populations. Although relationships among individuals have been inves...

Descripción completa

Detalles Bibliográficos
Autores principales: Gazal, Steven, Sahbatou, Mourad, Babron, Marie-Claude, Génin, Emmanuelle, Leutenegger, Anne-Louise
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4667178/
https://www.ncbi.nlm.nih.gov/pubmed/26625947
http://dx.doi.org/10.1038/srep17453
_version_ 1782403794004344832
author Gazal, Steven
Sahbatou, Mourad
Babron, Marie-Claude
Génin, Emmanuelle
Leutenegger, Anne-Louise
author_facet Gazal, Steven
Sahbatou, Mourad
Babron, Marie-Claude
Génin, Emmanuelle
Leutenegger, Anne-Louise
author_sort Gazal, Steven
collection PubMed
description The 1000 Genomes Project provides a unique source of whole genome sequencing data for studies of human population genetics and human diseases. The last release of this project includes more than 2,500 sequenced individuals from 26 populations. Although relationships among individuals have been investigated in some of the populations, inbreeding has never been studied. In this article, we estimated the genomic inbreeding coefficient of each individual and found an unexpected high level of inbreeding in 1000 Genomes data: nearly a quarter of the individuals were inbred and around 4% of them had inbreeding coefficients similar or greater than the ones expected for first-cousin offspring. Inbred individuals were found in each of the 26 populations, with some populations showing proportions of inbred individuals above 50%. We also detected 227 previously unreported pairs of close relatives (up to and including first-cousins). Thus, we propose subsets of unrelated and outbred individuals, for use by the scientific community. In addition, because admixed populations are present in the 1000 Genomes Project, we performed simulations to study the robustness of inbreeding coefficient estimates in the presence of admixture. We found that our multi-point approach (FSuite) was quite robust to admixture, unlike single-point methods (PLINK).
format Online
Article
Text
id pubmed-4667178
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-46671782015-12-03 High level of inbreeding in final phase of 1000 Genomes Project Gazal, Steven Sahbatou, Mourad Babron, Marie-Claude Génin, Emmanuelle Leutenegger, Anne-Louise Sci Rep Article The 1000 Genomes Project provides a unique source of whole genome sequencing data for studies of human population genetics and human diseases. The last release of this project includes more than 2,500 sequenced individuals from 26 populations. Although relationships among individuals have been investigated in some of the populations, inbreeding has never been studied. In this article, we estimated the genomic inbreeding coefficient of each individual and found an unexpected high level of inbreeding in 1000 Genomes data: nearly a quarter of the individuals were inbred and around 4% of them had inbreeding coefficients similar or greater than the ones expected for first-cousin offspring. Inbred individuals were found in each of the 26 populations, with some populations showing proportions of inbred individuals above 50%. We also detected 227 previously unreported pairs of close relatives (up to and including first-cousins). Thus, we propose subsets of unrelated and outbred individuals, for use by the scientific community. In addition, because admixed populations are present in the 1000 Genomes Project, we performed simulations to study the robustness of inbreeding coefficient estimates in the presence of admixture. We found that our multi-point approach (FSuite) was quite robust to admixture, unlike single-point methods (PLINK). Nature Publishing Group 2015-12-02 /pmc/articles/PMC4667178/ /pubmed/26625947 http://dx.doi.org/10.1038/srep17453 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Gazal, Steven
Sahbatou, Mourad
Babron, Marie-Claude
Génin, Emmanuelle
Leutenegger, Anne-Louise
High level of inbreeding in final phase of 1000 Genomes Project
title High level of inbreeding in final phase of 1000 Genomes Project
title_full High level of inbreeding in final phase of 1000 Genomes Project
title_fullStr High level of inbreeding in final phase of 1000 Genomes Project
title_full_unstemmed High level of inbreeding in final phase of 1000 Genomes Project
title_short High level of inbreeding in final phase of 1000 Genomes Project
title_sort high level of inbreeding in final phase of 1000 genomes project
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4667178/
https://www.ncbi.nlm.nih.gov/pubmed/26625947
http://dx.doi.org/10.1038/srep17453
work_keys_str_mv AT gazalsteven highlevelofinbreedinginfinalphaseof1000genomesproject
AT sahbatoumourad highlevelofinbreedinginfinalphaseof1000genomesproject
AT babronmarieclaude highlevelofinbreedinginfinalphaseof1000genomesproject
AT geninemmanuelle highlevelofinbreedinginfinalphaseof1000genomesproject
AT leuteneggerannelouise highlevelofinbreedinginfinalphaseof1000genomesproject