Cargando…
Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range
Plasmids mediate gene exchange across taxonomic barriers through conjugation, shaping bacterial evolution for billions of years. While plasmid mobility can be harnessed for genetic engineering and drug-delivery applications, rapid plasmid-mediated spread of resistance genes has rendered most clinica...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10670969/ https://www.ncbi.nlm.nih.gov/pubmed/38002987 http://dx.doi.org/10.3390/genes14112044 |
_version_ | 1785140044966658048 |
---|---|
author | Wang, Bing Finazzo, Mark Artsimovitch, Irina |
author_facet | Wang, Bing Finazzo, Mark Artsimovitch, Irina |
author_sort | Wang, Bing |
collection | PubMed |
description | Plasmids mediate gene exchange across taxonomic barriers through conjugation, shaping bacterial evolution for billions of years. While plasmid mobility can be harnessed for genetic engineering and drug-delivery applications, rapid plasmid-mediated spread of resistance genes has rendered most clinical antibiotics useless. To solve this urgent and growing problem, we must understand how plasmids spread across bacterial communities. Here, we applied machine-learning models to identify features that are important for extending the plasmid host range. We assembled an up-to-date dataset of more than thirty thousand bacterial plasmids, separated them into 1125 clusters, and assigned each cluster a distribution possibility score, taking into account the host distribution of each taxonomic rank and the sampling bias of the existing sequencing data. Using this score and an optimized plasmid feature pool, we built a model stack consisting of DecisionTreeRegressor, EvoTreeRegressor, and LGBMRegressor as base models and LinearRegressor as a meta-learner. Our mathematical modeling revealed that sequence brevity is the most important determinant for plasmid spread, followed by P-loop NTPases, mobility factors, and β-lactamases. Ours and other recent results suggest that small plasmids may broaden their range by evading host defenses and using alternative modes of transfer instead of autonomous conjugation. |
format | Online Article Text |
id | pubmed-10670969 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-106709692023-11-05 Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range Wang, Bing Finazzo, Mark Artsimovitch, Irina Genes (Basel) Article Plasmids mediate gene exchange across taxonomic barriers through conjugation, shaping bacterial evolution for billions of years. While plasmid mobility can be harnessed for genetic engineering and drug-delivery applications, rapid plasmid-mediated spread of resistance genes has rendered most clinical antibiotics useless. To solve this urgent and growing problem, we must understand how plasmids spread across bacterial communities. Here, we applied machine-learning models to identify features that are important for extending the plasmid host range. We assembled an up-to-date dataset of more than thirty thousand bacterial plasmids, separated them into 1125 clusters, and assigned each cluster a distribution possibility score, taking into account the host distribution of each taxonomic rank and the sampling bias of the existing sequencing data. Using this score and an optimized plasmid feature pool, we built a model stack consisting of DecisionTreeRegressor, EvoTreeRegressor, and LGBMRegressor as base models and LinearRegressor as a meta-learner. Our mathematical modeling revealed that sequence brevity is the most important determinant for plasmid spread, followed by P-loop NTPases, mobility factors, and β-lactamases. Ours and other recent results suggest that small plasmids may broaden their range by evading host defenses and using alternative modes of transfer instead of autonomous conjugation. MDPI 2023-11-05 /pmc/articles/PMC10670969/ /pubmed/38002987 http://dx.doi.org/10.3390/genes14112044 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Bing Finazzo, Mark Artsimovitch, Irina Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range |
title | Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range |
title_full | Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range |
title_fullStr | Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range |
title_full_unstemmed | Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range |
title_short | Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range |
title_sort | machine learning suggests that small size helps broaden plasmid host range |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10670969/ https://www.ncbi.nlm.nih.gov/pubmed/38002987 http://dx.doi.org/10.3390/genes14112044 |
work_keys_str_mv | AT wangbing machinelearningsuggeststhatsmallsizehelpsbroadenplasmidhostrange AT finazzomark machinelearningsuggeststhatsmallsizehelpsbroadenplasmidhostrange AT artsimovitchirina machinelearningsuggeststhatsmallsizehelpsbroadenplasmidhostrange |