Cargando…
Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe
Horizontal gene transfer (HGT) plays a critical role in the evolution and diversification of many microbial species. The resulting dynamics of gene gain and loss can have important implications for the development of antibiotic resistance and the design of vaccine and drug interventions. Methods for...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9977150/ https://www.ncbi.nlm.nih.gov/pubmed/36669850 http://dx.doi.org/10.1101/gr.277340.122 |
_version_ | 1784899232360038400 |
---|---|
author | Tonkin-Hill, Gerry Gladstone, Rebecca A. Pöntinen, Anna K. Arredondo-Alonso, Sergio Bentley, Stephen D. Corander, Jukka |
author_facet | Tonkin-Hill, Gerry Gladstone, Rebecca A. Pöntinen, Anna K. Arredondo-Alonso, Sergio Bentley, Stephen D. Corander, Jukka |
author_sort | Tonkin-Hill, Gerry |
collection | PubMed |
description | Horizontal gene transfer (HGT) plays a critical role in the evolution and diversification of many microbial species. The resulting dynamics of gene gain and loss can have important implications for the development of antibiotic resistance and the design of vaccine and drug interventions. Methods for the analysis of gene presence/absence patterns typically do not account for errors introduced in the automated annotation and clustering of gene sequences. In particular, methods adapted from ecological studies, including the pangenome gene accumulation curve, can be misleading as they may reflect the underlying diversity in the temporal sampling of genomes rather than a difference in the dynamics of HGT. Here, we introduce Panstripe, a method based on generalized linear regression that is robust to population structure, sampling bias, and errors in the predicted presence/absence of genes. We show using simulations that Panstripe can effectively identify differences in the rate and number of genes involved in HGT events, and illustrate its capability by analyzing several diverse bacterial genome data sets representing major human pathogens. |
format | Online Article Text |
id | pubmed-9977150 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-99771502023-03-02 Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe Tonkin-Hill, Gerry Gladstone, Rebecca A. Pöntinen, Anna K. Arredondo-Alonso, Sergio Bentley, Stephen D. Corander, Jukka Genome Res Method Horizontal gene transfer (HGT) plays a critical role in the evolution and diversification of many microbial species. The resulting dynamics of gene gain and loss can have important implications for the development of antibiotic resistance and the design of vaccine and drug interventions. Methods for the analysis of gene presence/absence patterns typically do not account for errors introduced in the automated annotation and clustering of gene sequences. In particular, methods adapted from ecological studies, including the pangenome gene accumulation curve, can be misleading as they may reflect the underlying diversity in the temporal sampling of genomes rather than a difference in the dynamics of HGT. Here, we introduce Panstripe, a method based on generalized linear regression that is robust to population structure, sampling bias, and errors in the predicted presence/absence of genes. We show using simulations that Panstripe can effectively identify differences in the rate and number of genes involved in HGT events, and illustrate its capability by analyzing several diverse bacterial genome data sets representing major human pathogens. Cold Spring Harbor Laboratory Press 2023-01 /pmc/articles/PMC9977150/ /pubmed/36669850 http://dx.doi.org/10.1101/gr.277340.122 Text en © 2023 Tonkin-Hill et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by/4.0/This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Method Tonkin-Hill, Gerry Gladstone, Rebecca A. Pöntinen, Anna K. Arredondo-Alonso, Sergio Bentley, Stephen D. Corander, Jukka Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe |
title | Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe |
title_full | Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe |
title_fullStr | Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe |
title_full_unstemmed | Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe |
title_short | Robust analysis of prokaryotic pangenome gene gain and loss rates with Panstripe |
title_sort | robust analysis of prokaryotic pangenome gene gain and loss rates with panstripe |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9977150/ https://www.ncbi.nlm.nih.gov/pubmed/36669850 http://dx.doi.org/10.1101/gr.277340.122 |
work_keys_str_mv | AT tonkinhillgerry robustanalysisofprokaryoticpangenomegenegainandlossrateswithpanstripe AT gladstonerebeccaa robustanalysisofprokaryoticpangenomegenegainandlossrateswithpanstripe AT pontinenannak robustanalysisofprokaryoticpangenomegenegainandlossrateswithpanstripe AT arredondoalonsosergio robustanalysisofprokaryoticpangenomegenegainandlossrateswithpanstripe AT bentleystephend robustanalysisofprokaryoticpangenomegenegainandlossrateswithpanstripe AT coranderjukka robustanalysisofprokaryoticpangenomegenegainandlossrateswithpanstripe |