Cargando…

Tackling the Pangenome Dilemma Requires the Concerted Analysis of Multiple Population Genetic Processes

The pangenome is the set of all genes present in a prokaryotic population. Most pangenomes contain many accessory genes of low and intermediate frequencies. Different population genetics processes contribute to the shape of these pangenomes, namely selection and fitness-independent processes such as...

Descripción completa

Detalles Bibliográficos
Autores principales: Baumdicker, Franz, Kupczok, Anne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10182730/
https://www.ncbi.nlm.nih.gov/pubmed/37093956
http://dx.doi.org/10.1093/gbe/evad067
_version_ 1785041814365929472
author Baumdicker, Franz
Kupczok, Anne
author_facet Baumdicker, Franz
Kupczok, Anne
author_sort Baumdicker, Franz
collection PubMed
description The pangenome is the set of all genes present in a prokaryotic population. Most pangenomes contain many accessory genes of low and intermediate frequencies. Different population genetics processes contribute to the shape of these pangenomes, namely selection and fitness-independent processes such as gene transfer, gene loss, and migration. However, their relative importance is unknown and highly debated. Here, we argue that the debate around prokaryotic pangenomes arose due to the imprecise application of population genetics models. Most importantly, two different processes of horizontal gene transfer act on prokaryotic populations, which are frequently confused, despite their fundamentally different behavior. Genes acquired from distantly related organisms (termed here acquiring gene transfer) are most comparable to mutation in nucleotide sequences. In contrast, gene gain within the population (termed here spreading gene transfer) has an effect on gene frequencies that is identical to the effect of positive selection on single genes. We thus show that selection and fitness-independent population genetic processes affecting pangenomes are indistinguishable at the level of single gene dynamics. Nevertheless, population genetics processes are fundamentally different when considering the joint distribution of all accessory genes across individuals of a population. We propose that, to understand to which degree the different processes shaped pangenome diversity, the development of comprehensive models and simulation tools is mandatory. Furthermore, we need to identify summary statistics and measurable features that can distinguish between the processes, where considering the joint distribution of accessory genes across individuals of a population will be particularly relevant.
format Online
Article
Text
id pubmed-10182730
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101827302023-05-14 Tackling the Pangenome Dilemma Requires the Concerted Analysis of Multiple Population Genetic Processes Baumdicker, Franz Kupczok, Anne Genome Biol Evol Perspective The pangenome is the set of all genes present in a prokaryotic population. Most pangenomes contain many accessory genes of low and intermediate frequencies. Different population genetics processes contribute to the shape of these pangenomes, namely selection and fitness-independent processes such as gene transfer, gene loss, and migration. However, their relative importance is unknown and highly debated. Here, we argue that the debate around prokaryotic pangenomes arose due to the imprecise application of population genetics models. Most importantly, two different processes of horizontal gene transfer act on prokaryotic populations, which are frequently confused, despite their fundamentally different behavior. Genes acquired from distantly related organisms (termed here acquiring gene transfer) are most comparable to mutation in nucleotide sequences. In contrast, gene gain within the population (termed here spreading gene transfer) has an effect on gene frequencies that is identical to the effect of positive selection on single genes. We thus show that selection and fitness-independent population genetic processes affecting pangenomes are indistinguishable at the level of single gene dynamics. Nevertheless, population genetics processes are fundamentally different when considering the joint distribution of all accessory genes across individuals of a population. We propose that, to understand to which degree the different processes shaped pangenome diversity, the development of comprehensive models and simulation tools is mandatory. Furthermore, we need to identify summary statistics and measurable features that can distinguish between the processes, where considering the joint distribution of accessory genes across individuals of a population will be particularly relevant. Oxford University Press 2023-04-24 /pmc/articles/PMC10182730/ /pubmed/37093956 http://dx.doi.org/10.1093/gbe/evad067 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Perspective
Baumdicker, Franz
Kupczok, Anne
Tackling the Pangenome Dilemma Requires the Concerted Analysis of Multiple Population Genetic Processes
title Tackling the Pangenome Dilemma Requires the Concerted Analysis of Multiple Population Genetic Processes
title_full Tackling the Pangenome Dilemma Requires the Concerted Analysis of Multiple Population Genetic Processes
title_fullStr Tackling the Pangenome Dilemma Requires the Concerted Analysis of Multiple Population Genetic Processes
title_full_unstemmed Tackling the Pangenome Dilemma Requires the Concerted Analysis of Multiple Population Genetic Processes
title_short Tackling the Pangenome Dilemma Requires the Concerted Analysis of Multiple Population Genetic Processes
title_sort tackling the pangenome dilemma requires the concerted analysis of multiple population genetic processes
topic Perspective
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10182730/
https://www.ncbi.nlm.nih.gov/pubmed/37093956
http://dx.doi.org/10.1093/gbe/evad067
work_keys_str_mv AT baumdickerfranz tacklingthepangenomedilemmarequirestheconcertedanalysisofmultiplepopulationgeneticprocesses
AT kupczokanne tacklingthepangenomedilemmarequirestheconcertedanalysisofmultiplepopulationgeneticprocesses