Cargando…

Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe

Horizontal gene transfer (HGT) plays a critical role in the evolution and diversification of many microbial species. The resulting dynamics of gene gain and loss can have important implications for the development of antibiotic resistance and the design of vaccine and drug interventions. Methods for...

Descripción completa

Detalles Bibliográficos
Autores principales: Tonkin-Hill, Gerry, Gladstone, Rebecca A., Pöntinen, Anna K., Arredondo-Alonso, Sergio, Bentley, Stephen D., Corander, Jukka
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9977150/
https://www.ncbi.nlm.nih.gov/pubmed/36669850
http://dx.doi.org/10.1101/gr.277340.122
_version_ 1784899232360038400
author Tonkin-Hill, Gerry
Gladstone, Rebecca A.
Pöntinen, Anna K.
Arredondo-Alonso, Sergio
Bentley, Stephen D.
Corander, Jukka
author_facet Tonkin-Hill, Gerry
Gladstone, Rebecca A.
Pöntinen, Anna K.
Arredondo-Alonso, Sergio
Bentley, Stephen D.
Corander, Jukka
author_sort Tonkin-Hill, Gerry
collection PubMed
description Horizontal gene transfer (HGT) plays a critical role in the evolution and diversification of many microbial species. The resulting dynamics of gene gain and loss can have important implications for the development of antibiotic resistance and the design of vaccine and drug interventions. Methods for the analysis of gene presence/absence patterns typically do not account for errors introduced in the automated annotation and clustering of gene sequences. In particular, methods adapted from ecological studies, including the pangenome gene accumulation curve, can be misleading as they may reflect the underlying diversity in the temporal sampling of genomes rather than a difference in the dynamics of HGT. Here, we introduce Panstripe, a method based on generalized linear regression that is robust to population structure, sampling bias, and errors in the predicted presence/absence of genes. We show using simulations that Panstripe can effectively identify differences in the rate and number of genes involved in HGT events, and illustrate its capability by analyzing several diverse bacterial genome data sets representing major human pathogens.
format Online
Article
Text
id pubmed-9977150
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-99771502023-03-02 Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe Tonkin-Hill, Gerry Gladstone, Rebecca A. Pöntinen, Anna K. Arredondo-Alonso, Sergio Bentley, Stephen D. Corander, Jukka Genome Res Method Horizontal gene transfer (HGT) plays a critical role in the evolution and diversification of many microbial species. The resulting dynamics of gene gain and loss can have important implications for the development of antibiotic resistance and the design of vaccine and drug interventions. Methods for the analysis of gene presence/absence patterns typically do not account for errors introduced in the automated annotation and clustering of gene sequences. In particular, methods adapted from ecological studies, including the pangenome gene accumulation curve, can be misleading as they may reflect the underlying diversity in the temporal sampling of genomes rather than a difference in the dynamics of HGT. Here, we introduce Panstripe, a method based on generalized linear regression that is robust to population structure, sampling bias, and errors in the predicted presence/absence of genes. We show using simulations that Panstripe can effectively identify differences in the rate and number of genes involved in HGT events, and illustrate its capability by analyzing several diverse bacterial genome data sets representing major human pathogens. Cold Spring Harbor Laboratory Press 2023-01 /pmc/articles/PMC9977150/ /pubmed/36669850 http://dx.doi.org/10.1101/gr.277340.122 Text en © 2023 Tonkin-Hill et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by/4.0/This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Method
Tonkin-Hill, Gerry
Gladstone, Rebecca A.
Pöntinen, Anna K.
Arredondo-Alonso, Sergio
Bentley, Stephen D.
Corander, Jukka
Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe
title Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe
title_full Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe
title_fullStr Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe
title_full_unstemmed Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe
title_short Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe
title_sort robust analysis of prokaryotic pangenome gene gain and loss rates with panstripe
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9977150/
https://www.ncbi.nlm.nih.gov/pubmed/36669850
http://dx.doi.org/10.1101/gr.277340.122
work_keys_str_mv AT tonkinhillgerry robustanalysisofprokaryoticpangenomegenegainandlossrateswithpanstripe
AT gladstonerebeccaa robustanalysisofprokaryoticpangenomegenegainandlossrateswithpanstripe
AT pontinenannak robustanalysisofprokaryoticpangenomegenegainandlossrateswithpanstripe
AT arredondoalonsosergio robustanalysisofprokaryoticpangenomegenegainandlossrateswithpanstripe
AT bentleystephend robustanalysisofprokaryoticpangenomegenegainandlossrateswithpanstripe
AT coranderjukka robustanalysisofprokaryoticpangenomegenegainandlossrateswithpanstripe