Cargando…

The Empirical Distribution of Singletons for Geographic Samples of DNA Sequences

Rare variants are important for drawing inference about past demographic events in a species history. A singleton is a rare variant for which genetic variation is carried by a unique chromosome in a sample. How singletons are distributed across geographic space provides a local measure of genetic di...

Descripción completa

Detalles Bibliográficos
Autores principales: Cubry, Philippe, Vigouroux, Yves, François, Olivier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5627571/
https://www.ncbi.nlm.nih.gov/pubmed/29033977
http://dx.doi.org/10.3389/fgene.2017.00139
_version_ 1783268739842572288
author Cubry, Philippe
Vigouroux, Yves
François, Olivier
author_facet Cubry, Philippe
Vigouroux, Yves
François, Olivier
author_sort Cubry, Philippe
collection PubMed
description Rare variants are important for drawing inference about past demographic events in a species history. A singleton is a rare variant for which genetic variation is carried by a unique chromosome in a sample. How singletons are distributed across geographic space provides a local measure of genetic diversity that can be measured at the individual level. Here, we define the empirical distribution of singletons in a sample of chromosomes as the proportion of the total number of singletons that each chromosome carries, and we present a theoretical background for studying this distribution. Next, we use computer simulations to evaluate the potential for the empirical distribution of singletons to provide a description of genetic diversity across geographic space. In a Bayesian framework, we show that the empirical distribution of singletons leads to accurate estimates of the geographic origin of range expansions. We apply the Bayesian approach to estimating the origin of the cultivated plant species Pennisetum glaucum [L.] R. Br. (pearl millet) in Africa, and find support for range expansion having started from Northern Mali. Overall, we report that the empirical distribution of singletons is a useful measure to analyze results of sequencing projects based on large scale sampling of individuals across geographic space.
format Online
Article
Text
id pubmed-5627571
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-56275712017-10-13 The Empirical Distribution of Singletons for Geographic Samples of DNA Sequences Cubry, Philippe Vigouroux, Yves François, Olivier Front Genet Genetics Rare variants are important for drawing inference about past demographic events in a species history. A singleton is a rare variant for which genetic variation is carried by a unique chromosome in a sample. How singletons are distributed across geographic space provides a local measure of genetic diversity that can be measured at the individual level. Here, we define the empirical distribution of singletons in a sample of chromosomes as the proportion of the total number of singletons that each chromosome carries, and we present a theoretical background for studying this distribution. Next, we use computer simulations to evaluate the potential for the empirical distribution of singletons to provide a description of genetic diversity across geographic space. In a Bayesian framework, we show that the empirical distribution of singletons leads to accurate estimates of the geographic origin of range expansions. We apply the Bayesian approach to estimating the origin of the cultivated plant species Pennisetum glaucum [L.] R. Br. (pearl millet) in Africa, and find support for range expansion having started from Northern Mali. Overall, we report that the empirical distribution of singletons is a useful measure to analyze results of sequencing projects based on large scale sampling of individuals across geographic space. Frontiers Media S.A. 2017-09-29 /pmc/articles/PMC5627571/ /pubmed/29033977 http://dx.doi.org/10.3389/fgene.2017.00139 Text en Copyright © 2017 Cubry, Vigouroux and François. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Cubry, Philippe
Vigouroux, Yves
François, Olivier
The Empirical Distribution of Singletons for Geographic Samples of DNA Sequences
title The Empirical Distribution of Singletons for Geographic Samples of DNA Sequences
title_full The Empirical Distribution of Singletons for Geographic Samples of DNA Sequences
title_fullStr The Empirical Distribution of Singletons for Geographic Samples of DNA Sequences
title_full_unstemmed The Empirical Distribution of Singletons for Geographic Samples of DNA Sequences
title_short The Empirical Distribution of Singletons for Geographic Samples of DNA Sequences
title_sort empirical distribution of singletons for geographic samples of dna sequences
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5627571/
https://www.ncbi.nlm.nih.gov/pubmed/29033977
http://dx.doi.org/10.3389/fgene.2017.00139
work_keys_str_mv AT cubryphilippe theempiricaldistributionofsingletonsforgeographicsamplesofdnasequences
AT vigourouxyves theempiricaldistributionofsingletonsforgeographicsamplesofdnasequences
AT francoisolivier theempiricaldistributionofsingletonsforgeographicsamplesofdnasequences
AT cubryphilippe empiricaldistributionofsingletonsforgeographicsamplesofdnasequences
AT vigourouxyves empiricaldistributionofsingletonsforgeographicsamplesofdnasequences
AT francoisolivier empiricaldistributionofsingletonsforgeographicsamplesofdnasequences