Cargando…
PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest
Plasmids play a major role facilitating the spread of antimicrobial resistance between bacteria. Understanding the host range and dissemination trajectories of plasmids is critical for surveillance and prevention of antimicrobial resistance. Identification of plasmid host ranges could be improved us...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society for Microbiology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9040769/ https://www.ncbi.nlm.nih.gov/pubmed/35382558 http://dx.doi.org/10.1128/msystems.01180-21 |
_version_ | 1784694405992546304 |
---|---|
author | Aytan-Aktug, Derya Clausen, Philip T. L. C. Szarvas, Judit Munk, Patrick Otani, Saria Nguyen, Marcus Davis, James J. Lund, Ole Aarestrup, Frank M. |
author_facet | Aytan-Aktug, Derya Clausen, Philip T. L. C. Szarvas, Judit Munk, Patrick Otani, Saria Nguyen, Marcus Davis, James J. Lund, Ole Aarestrup, Frank M. |
author_sort | Aytan-Aktug, Derya |
collection | PubMed |
description | Plasmids play a major role facilitating the spread of antimicrobial resistance between bacteria. Understanding the host range and dissemination trajectories of plasmids is critical for surveillance and prevention of antimicrobial resistance. Identification of plasmid host ranges could be improved using automated pattern detection methods compared to homology-based methods due to the diversity and genetic plasticity of plasmids. In this study, we developed a method for predicting the host range of plasmids using machine learning—specifically, random forests. We trained the models with 8,519 plasmids from 359 different bacterial species per taxonomic level; the models achieved Matthews correlation coefficients of 0.662 and 0.867 at the species and order levels, respectively. Our results suggest that despite the diverse nature and genetic plasticity of plasmids, our random forest model can accurately distinguish between plasmid hosts. This tool is available online through the Center for Genomic Epidemiology (https://cge.cbs.dtu.dk/services/PlasmidHostFinder/). IMPORTANCE Antimicrobial resistance is a global health threat to humans and animals, causing high mortality and morbidity while effectively ending decades of success in fighting against bacterial infections. Plasmids confer extra genetic capabilities to the host organisms through accessory genes that can encode antimicrobial resistance and virulence. In addition to lateral inheritance, plasmids can be transferred horizontally between bacterial taxa. Therefore, detection of the host range of plasmids is crucial for understanding and predicting the dissemination trajectories of extrachromosomal genes and bacterial evolution as well as taking effective countermeasures against antimicrobial resistance. |
format | Online Article Text |
id | pubmed-9040769 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Society for Microbiology |
record_format | MEDLINE/PubMed |
spelling | pubmed-90407692022-04-27 PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest Aytan-Aktug, Derya Clausen, Philip T. L. C. Szarvas, Judit Munk, Patrick Otani, Saria Nguyen, Marcus Davis, James J. Lund, Ole Aarestrup, Frank M. mSystems Research Article Plasmids play a major role facilitating the spread of antimicrobial resistance between bacteria. Understanding the host range and dissemination trajectories of plasmids is critical for surveillance and prevention of antimicrobial resistance. Identification of plasmid host ranges could be improved using automated pattern detection methods compared to homology-based methods due to the diversity and genetic plasticity of plasmids. In this study, we developed a method for predicting the host range of plasmids using machine learning—specifically, random forests. We trained the models with 8,519 plasmids from 359 different bacterial species per taxonomic level; the models achieved Matthews correlation coefficients of 0.662 and 0.867 at the species and order levels, respectively. Our results suggest that despite the diverse nature and genetic plasticity of plasmids, our random forest model can accurately distinguish between plasmid hosts. This tool is available online through the Center for Genomic Epidemiology (https://cge.cbs.dtu.dk/services/PlasmidHostFinder/). IMPORTANCE Antimicrobial resistance is a global health threat to humans and animals, causing high mortality and morbidity while effectively ending decades of success in fighting against bacterial infections. Plasmids confer extra genetic capabilities to the host organisms through accessory genes that can encode antimicrobial resistance and virulence. In addition to lateral inheritance, plasmids can be transferred horizontally between bacterial taxa. Therefore, detection of the host range of plasmids is crucial for understanding and predicting the dissemination trajectories of extrachromosomal genes and bacterial evolution as well as taking effective countermeasures against antimicrobial resistance. American Society for Microbiology 2022-04-06 /pmc/articles/PMC9040769/ /pubmed/35382558 http://dx.doi.org/10.1128/msystems.01180-21 Text en Copyright © 2022 Aytan-Aktug et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Research Article Aytan-Aktug, Derya Clausen, Philip T. L. C. Szarvas, Judit Munk, Patrick Otani, Saria Nguyen, Marcus Davis, James J. Lund, Ole Aarestrup, Frank M. PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest |
title | PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest |
title_full | PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest |
title_fullStr | PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest |
title_full_unstemmed | PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest |
title_short | PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest |
title_sort | plasmidhostfinder: prediction of plasmid hosts using random forest |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9040769/ https://www.ncbi.nlm.nih.gov/pubmed/35382558 http://dx.doi.org/10.1128/msystems.01180-21 |
work_keys_str_mv | AT aytanaktugderya plasmidhostfinderpredictionofplasmidhostsusingrandomforest AT clausenphiliptlc plasmidhostfinderpredictionofplasmidhostsusingrandomforest AT szarvasjudit plasmidhostfinderpredictionofplasmidhostsusingrandomforest AT munkpatrick plasmidhostfinderpredictionofplasmidhostsusingrandomforest AT otanisaria plasmidhostfinderpredictionofplasmidhostsusingrandomforest AT nguyenmarcus plasmidhostfinderpredictionofplasmidhostsusingrandomforest AT davisjamesj plasmidhostfinderpredictionofplasmidhostsusingrandomforest AT lundole plasmidhostfinderpredictionofplasmidhostsusingrandomforest AT aarestrupfrankm plasmidhostfinderpredictionofplasmidhostsusingrandomforest |