Cargando…

PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest

Plasmids play a major role facilitating the spread of antimicrobial resistance between bacteria. Understanding the host range and dissemination trajectories of plasmids is critical for surveillance and prevention of antimicrobial resistance. Identification of plasmid host ranges could be improved us...

Descripción completa

Detalles Bibliográficos
Autores principales: Aytan-Aktug, Derya, Clausen, Philip T. L. C., Szarvas, Judit, Munk, Patrick, Otani, Saria, Nguyen, Marcus, Davis, James J., Lund, Ole, Aarestrup, Frank M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9040769/
https://www.ncbi.nlm.nih.gov/pubmed/35382558
http://dx.doi.org/10.1128/msystems.01180-21
_version_ 1784694405992546304
author Aytan-Aktug, Derya
Clausen, Philip T. L. C.
Szarvas, Judit
Munk, Patrick
Otani, Saria
Nguyen, Marcus
Davis, James J.
Lund, Ole
Aarestrup, Frank M.
author_facet Aytan-Aktug, Derya
Clausen, Philip T. L. C.
Szarvas, Judit
Munk, Patrick
Otani, Saria
Nguyen, Marcus
Davis, James J.
Lund, Ole
Aarestrup, Frank M.
author_sort Aytan-Aktug, Derya
collection PubMed
description Plasmids play a major role facilitating the spread of antimicrobial resistance between bacteria. Understanding the host range and dissemination trajectories of plasmids is critical for surveillance and prevention of antimicrobial resistance. Identification of plasmid host ranges could be improved using automated pattern detection methods compared to homology-based methods due to the diversity and genetic plasticity of plasmids. In this study, we developed a method for predicting the host range of plasmids using machine learning—specifically, random forests. We trained the models with 8,519 plasmids from 359 different bacterial species per taxonomic level; the models achieved Matthews correlation coefficients of 0.662 and 0.867 at the species and order levels, respectively. Our results suggest that despite the diverse nature and genetic plasticity of plasmids, our random forest model can accurately distinguish between plasmid hosts. This tool is available online through the Center for Genomic Epidemiology (https://cge.cbs.dtu.dk/services/PlasmidHostFinder/). IMPORTANCE Antimicrobial resistance is a global health threat to humans and animals, causing high mortality and morbidity while effectively ending decades of success in fighting against bacterial infections. Plasmids confer extra genetic capabilities to the host organisms through accessory genes that can encode antimicrobial resistance and virulence. In addition to lateral inheritance, plasmids can be transferred horizontally between bacterial taxa. Therefore, detection of the host range of plasmids is crucial for understanding and predicting the dissemination trajectories of extrachromosomal genes and bacterial evolution as well as taking effective countermeasures against antimicrobial resistance.
format Online
Article
Text
id pubmed-9040769
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-90407692022-04-27 PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest Aytan-Aktug, Derya Clausen, Philip T. L. C. Szarvas, Judit Munk, Patrick Otani, Saria Nguyen, Marcus Davis, James J. Lund, Ole Aarestrup, Frank M. mSystems Research Article Plasmids play a major role facilitating the spread of antimicrobial resistance between bacteria. Understanding the host range and dissemination trajectories of plasmids is critical for surveillance and prevention of antimicrobial resistance. Identification of plasmid host ranges could be improved using automated pattern detection methods compared to homology-based methods due to the diversity and genetic plasticity of plasmids. In this study, we developed a method for predicting the host range of plasmids using machine learning—specifically, random forests. We trained the models with 8,519 plasmids from 359 different bacterial species per taxonomic level; the models achieved Matthews correlation coefficients of 0.662 and 0.867 at the species and order levels, respectively. Our results suggest that despite the diverse nature and genetic plasticity of plasmids, our random forest model can accurately distinguish between plasmid hosts. This tool is available online through the Center for Genomic Epidemiology (https://cge.cbs.dtu.dk/services/PlasmidHostFinder/). IMPORTANCE Antimicrobial resistance is a global health threat to humans and animals, causing high mortality and morbidity while effectively ending decades of success in fighting against bacterial infections. Plasmids confer extra genetic capabilities to the host organisms through accessory genes that can encode antimicrobial resistance and virulence. In addition to lateral inheritance, plasmids can be transferred horizontally between bacterial taxa. Therefore, detection of the host range of plasmids is crucial for understanding and predicting the dissemination trajectories of extrachromosomal genes and bacterial evolution as well as taking effective countermeasures against antimicrobial resistance. American Society for Microbiology 2022-04-06 /pmc/articles/PMC9040769/ /pubmed/35382558 http://dx.doi.org/10.1128/msystems.01180-21 Text en Copyright © 2022 Aytan-Aktug et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research Article
Aytan-Aktug, Derya
Clausen, Philip T. L. C.
Szarvas, Judit
Munk, Patrick
Otani, Saria
Nguyen, Marcus
Davis, James J.
Lund, Ole
Aarestrup, Frank M.
PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest
title PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest
title_full PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest
title_fullStr PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest
title_full_unstemmed PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest
title_short PlasmidHostFinder: Prediction of Plasmid Hosts Using Random Forest
title_sort plasmidhostfinder: prediction of plasmid hosts using random forest
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9040769/
https://www.ncbi.nlm.nih.gov/pubmed/35382558
http://dx.doi.org/10.1128/msystems.01180-21
work_keys_str_mv AT aytanaktugderya plasmidhostfinderpredictionofplasmidhostsusingrandomforest
AT clausenphiliptlc plasmidhostfinderpredictionofplasmidhostsusingrandomforest
AT szarvasjudit plasmidhostfinderpredictionofplasmidhostsusingrandomforest
AT munkpatrick plasmidhostfinderpredictionofplasmidhostsusingrandomforest
AT otanisaria plasmidhostfinderpredictionofplasmidhostsusingrandomforest
AT nguyenmarcus plasmidhostfinderpredictionofplasmidhostsusingrandomforest
AT davisjamesj plasmidhostfinderpredictionofplasmidhostsusingrandomforest
AT lundole plasmidhostfinderpredictionofplasmidhostsusingrandomforest
AT aarestrupfrankm plasmidhostfinderpredictionofplasmidhostsusingrandomforest