Cargando…

Parallel or convergent evolution in human population genomic data revealed by genotype networks

BACKGROUND: Genotype networks are representations of genetic variation data that are complementary to phylogenetic trees. A genotype network is a graph whose nodes are genotypes (DNA sequences) with the same broadly defined phenotype. Two nodes are connected if they differ in some minimal way, e.g.,...

Descripción completa

Detalles Bibliográficos
Autores principales: R. Vahdati, Ali, Wagner, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4969671/
https://www.ncbi.nlm.nih.gov/pubmed/27484992
http://dx.doi.org/10.1186/s12862-016-0722-0
_version_ 1782445820066398208
author R. Vahdati, Ali
Wagner, Andreas
author_facet R. Vahdati, Ali
Wagner, Andreas
author_sort R. Vahdati, Ali
collection PubMed
description BACKGROUND: Genotype networks are representations of genetic variation data that are complementary to phylogenetic trees. A genotype network is a graph whose nodes are genotypes (DNA sequences) with the same broadly defined phenotype. Two nodes are connected if they differ in some minimal way, e.g., in a single nucleotide. RESULTS: We analyze human genome variation data from the 1,000 genomes project, and construct haploid genotype (haplotype) networks for 12,235 protein coding genes. The structure of these networks varies widely among genes, indicating different patterns of variation despite a shared evolutionary history. We focus on those genes whose genotype networks show many cycles, which can indicate homoplasy, i.e., parallel or convergent evolution, on the sequence level. CONCLUSION: For 42 genes, the observed number of cycles is so large that it cannot be explained by either chance homoplasy or recombination. When analyzing possible explanations, we discovered evidence for positive selection in 21 of these genes and, in addition, a potential role for constrained variation and purifying selection. Balancing selection plays at most a small role. The 42 genes with excess cycles are enriched in functions related to immunity and response to pathogens. Genotype networks are representations of genetic variation data that can help understand unusual patterns of genomic variation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12862-016-0722-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4969671
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49696712016-08-03 Parallel or convergent evolution in human population genomic data revealed by genotype networks R. Vahdati, Ali Wagner, Andreas BMC Evol Biol Research Article BACKGROUND: Genotype networks are representations of genetic variation data that are complementary to phylogenetic trees. A genotype network is a graph whose nodes are genotypes (DNA sequences) with the same broadly defined phenotype. Two nodes are connected if they differ in some minimal way, e.g., in a single nucleotide. RESULTS: We analyze human genome variation data from the 1,000 genomes project, and construct haploid genotype (haplotype) networks for 12,235 protein coding genes. The structure of these networks varies widely among genes, indicating different patterns of variation despite a shared evolutionary history. We focus on those genes whose genotype networks show many cycles, which can indicate homoplasy, i.e., parallel or convergent evolution, on the sequence level. CONCLUSION: For 42 genes, the observed number of cycles is so large that it cannot be explained by either chance homoplasy or recombination. When analyzing possible explanations, we discovered evidence for positive selection in 21 of these genes and, in addition, a potential role for constrained variation and purifying selection. Balancing selection plays at most a small role. The 42 genes with excess cycles are enriched in functions related to immunity and response to pathogens. Genotype networks are representations of genetic variation data that can help understand unusual patterns of genomic variation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12862-016-0722-0) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-02 /pmc/articles/PMC4969671/ /pubmed/27484992 http://dx.doi.org/10.1186/s12862-016-0722-0 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
R. Vahdati, Ali
Wagner, Andreas
Parallel or convergent evolution in human population genomic data revealed by genotype networks
title Parallel or convergent evolution in human population genomic data revealed by genotype networks
title_full Parallel or convergent evolution in human population genomic data revealed by genotype networks
title_fullStr Parallel or convergent evolution in human population genomic data revealed by genotype networks
title_full_unstemmed Parallel or convergent evolution in human population genomic data revealed by genotype networks
title_short Parallel or convergent evolution in human population genomic data revealed by genotype networks
title_sort parallel or convergent evolution in human population genomic data revealed by genotype networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4969671/
https://www.ncbi.nlm.nih.gov/pubmed/27484992
http://dx.doi.org/10.1186/s12862-016-0722-0
work_keys_str_mv AT rvahdatiali parallelorconvergentevolutioninhumanpopulationgenomicdatarevealedbygenotypenetworks
AT wagnerandreas parallelorconvergentevolutioninhumanpopulationgenomicdatarevealedbygenotypenetworks