Cargando…

Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range

Plasmids mediate gene exchange across taxonomic barriers through conjugation, shaping bacterial evolution for billions of years. While plasmid mobility can be harnessed for genetic engineering and drug-delivery applications, rapid plasmid-mediated spread of resistance genes has rendered most clinica...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Bing, Finazzo, Mark, Artsimovitch, Irina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10670969/
https://www.ncbi.nlm.nih.gov/pubmed/38002987
http://dx.doi.org/10.3390/genes14112044
_version_ 1785140044966658048
author Wang, Bing
Finazzo, Mark
Artsimovitch, Irina
author_facet Wang, Bing
Finazzo, Mark
Artsimovitch, Irina
author_sort Wang, Bing
collection PubMed
description Plasmids mediate gene exchange across taxonomic barriers through conjugation, shaping bacterial evolution for billions of years. While plasmid mobility can be harnessed for genetic engineering and drug-delivery applications, rapid plasmid-mediated spread of resistance genes has rendered most clinical antibiotics useless. To solve this urgent and growing problem, we must understand how plasmids spread across bacterial communities. Here, we applied machine-learning models to identify features that are important for extending the plasmid host range. We assembled an up-to-date dataset of more than thirty thousand bacterial plasmids, separated them into 1125 clusters, and assigned each cluster a distribution possibility score, taking into account the host distribution of each taxonomic rank and the sampling bias of the existing sequencing data. Using this score and an optimized plasmid feature pool, we built a model stack consisting of DecisionTreeRegressor, EvoTreeRegressor, and LGBMRegressor as base models and LinearRegressor as a meta-learner. Our mathematical modeling revealed that sequence brevity is the most important determinant for plasmid spread, followed by P-loop NTPases, mobility factors, and β-lactamases. Ours and other recent results suggest that small plasmids may broaden their range by evading host defenses and using alternative modes of transfer instead of autonomous conjugation.
format Online
Article
Text
id pubmed-10670969
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106709692023-11-05 Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range Wang, Bing Finazzo, Mark Artsimovitch, Irina Genes (Basel) Article Plasmids mediate gene exchange across taxonomic barriers through conjugation, shaping bacterial evolution for billions of years. While plasmid mobility can be harnessed for genetic engineering and drug-delivery applications, rapid plasmid-mediated spread of resistance genes has rendered most clinical antibiotics useless. To solve this urgent and growing problem, we must understand how plasmids spread across bacterial communities. Here, we applied machine-learning models to identify features that are important for extending the plasmid host range. We assembled an up-to-date dataset of more than thirty thousand bacterial plasmids, separated them into 1125 clusters, and assigned each cluster a distribution possibility score, taking into account the host distribution of each taxonomic rank and the sampling bias of the existing sequencing data. Using this score and an optimized plasmid feature pool, we built a model stack consisting of DecisionTreeRegressor, EvoTreeRegressor, and LGBMRegressor as base models and LinearRegressor as a meta-learner. Our mathematical modeling revealed that sequence brevity is the most important determinant for plasmid spread, followed by P-loop NTPases, mobility factors, and β-lactamases. Ours and other recent results suggest that small plasmids may broaden their range by evading host defenses and using alternative modes of transfer instead of autonomous conjugation. MDPI 2023-11-05 /pmc/articles/PMC10670969/ /pubmed/38002987 http://dx.doi.org/10.3390/genes14112044 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Bing
Finazzo, Mark
Artsimovitch, Irina
Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range
title Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range
title_full Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range
title_fullStr Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range
title_full_unstemmed Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range
title_short Machine Learning Suggests That Small Size Helps Broaden Plasmid Host Range
title_sort machine learning suggests that small size helps broaden plasmid host range
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10670969/
https://www.ncbi.nlm.nih.gov/pubmed/38002987
http://dx.doi.org/10.3390/genes14112044
work_keys_str_mv AT wangbing machinelearningsuggeststhatsmallsizehelpsbroadenplasmidhostrange
AT finazzomark machinelearningsuggeststhatsmallsizehelpsbroadenplasmidhostrange
AT artsimovitchirina machinelearningsuggeststhatsmallsizehelpsbroadenplasmidhostrange