Cargando…
Universal whole-sequence-based plasmid typing and its utility to prediction of host range and epidemiological surveillance
Bacterial plasmids play a large role in allowing bacteria to adapt to changing environments and can pose a significant risk to human health if they confer virulence and antimicrobial resistance (AMR). Plasmids differ significantly in the taxonomic breadth of host bacteria in which they can successfu...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Microbiology Society
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7660255/ https://www.ncbi.nlm.nih.gov/pubmed/32969786 http://dx.doi.org/10.1099/mgen.0.000435 |
_version_ | 1783608972902662144 |
---|---|
author | Robertson, James Bessonov, Kyrylo Schonfeld, Justin Nash, John H. E. |
author_facet | Robertson, James Bessonov, Kyrylo Schonfeld, Justin Nash, John H. E. |
author_sort | Robertson, James |
collection | PubMed |
description | Bacterial plasmids play a large role in allowing bacteria to adapt to changing environments and can pose a significant risk to human health if they confer virulence and antimicrobial resistance (AMR). Plasmids differ significantly in the taxonomic breadth of host bacteria in which they can successfully replicate, this is commonly referred to as ‘host range’ and is usually described in qualitative terms of ‘narrow’ or ‘broad’. Understanding the host range potential of plasmids is of great interest due to their ability to disseminate traits such as AMR through bacterial populations and into human pathogens. We developed the MOB-suite to facilitate characterization of plasmids and introduced a whole-sequence-based classification system based on clustering complete plasmid sequences using Mash distances (https://github.com/phac-nml/mob-suite). We updated the MOB-suite database from 12 091 to 23 671 complete sequences, representing 17 779 unique plasmids. With advances in new algorithms for rapidly calculating average nucleotide identity (ANI), we compared clustering characteristics using two different distance measures – Mash and ANI – and three clustering algorithms on the unique set of plasmids. The plasmid nomenclature is designed to group highly similar plasmids together that are unlikely to have multiple representatives within a single cell. Based on our results, we determined that clusters generated using Mash and complete-linkage clustering at a Mash distance of 0.06 resulted in highly homogeneous clusters while maintaining cluster size. The taxonomic distribution of plasmid biomarker sequences for replication and relaxase typing, in combination with MOB-suite whole-sequence-based clusters have been examined in detail for all high-quality publicly available plasmid sequences. We have incorporated prediction of plasmid replication host range into the MOB-suite based on observed distributions of these sequence features in combination with known plasmid hosts from the literature. Host range is reported as the highest taxonomic rank that covers all of the plasmids which share replicon or relaxase biomarkers or belong to the same MOB-suite cluster code. Reporting host range based on these criteria allows for comparisons of host range between studies and provides information for plasmid surveillance. |
format | Online Article Text |
id | pubmed-7660255 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Microbiology Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-76602552020-11-13 Universal whole-sequence-based plasmid typing and its utility to prediction of host range and epidemiological surveillance Robertson, James Bessonov, Kyrylo Schonfeld, Justin Nash, John H. E. Microb Genom Research Article Bacterial plasmids play a large role in allowing bacteria to adapt to changing environments and can pose a significant risk to human health if they confer virulence and antimicrobial resistance (AMR). Plasmids differ significantly in the taxonomic breadth of host bacteria in which they can successfully replicate, this is commonly referred to as ‘host range’ and is usually described in qualitative terms of ‘narrow’ or ‘broad’. Understanding the host range potential of plasmids is of great interest due to their ability to disseminate traits such as AMR through bacterial populations and into human pathogens. We developed the MOB-suite to facilitate characterization of plasmids and introduced a whole-sequence-based classification system based on clustering complete plasmid sequences using Mash distances (https://github.com/phac-nml/mob-suite). We updated the MOB-suite database from 12 091 to 23 671 complete sequences, representing 17 779 unique plasmids. With advances in new algorithms for rapidly calculating average nucleotide identity (ANI), we compared clustering characteristics using two different distance measures – Mash and ANI – and three clustering algorithms on the unique set of plasmids. The plasmid nomenclature is designed to group highly similar plasmids together that are unlikely to have multiple representatives within a single cell. Based on our results, we determined that clusters generated using Mash and complete-linkage clustering at a Mash distance of 0.06 resulted in highly homogeneous clusters while maintaining cluster size. The taxonomic distribution of plasmid biomarker sequences for replication and relaxase typing, in combination with MOB-suite whole-sequence-based clusters have been examined in detail for all high-quality publicly available plasmid sequences. We have incorporated prediction of plasmid replication host range into the MOB-suite based on observed distributions of these sequence features in combination with known plasmid hosts from the literature. Host range is reported as the highest taxonomic rank that covers all of the plasmids which share replicon or relaxase biomarkers or belong to the same MOB-suite cluster code. Reporting host range based on these criteria allows for comparisons of host range between studies and provides information for plasmid surveillance. Microbiology Society 2020-09-24 /pmc/articles/PMC7660255/ /pubmed/32969786 http://dx.doi.org/10.1099/mgen.0.000435 Text en © 2020 Crown Copyright http://creativecommons.org/licenses/by-nc/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution NonCommercial License. |
spellingShingle | Research Article Robertson, James Bessonov, Kyrylo Schonfeld, Justin Nash, John H. E. Universal whole-sequence-based plasmid typing and its utility to prediction of host range and epidemiological surveillance |
title | Universal whole-sequence-based plasmid typing and its utility to prediction of host range and epidemiological surveillance |
title_full | Universal whole-sequence-based plasmid typing and its utility to prediction of host range and epidemiological surveillance |
title_fullStr | Universal whole-sequence-based plasmid typing and its utility to prediction of host range and epidemiological surveillance |
title_full_unstemmed | Universal whole-sequence-based plasmid typing and its utility to prediction of host range and epidemiological surveillance |
title_short | Universal whole-sequence-based plasmid typing and its utility to prediction of host range and epidemiological surveillance |
title_sort | universal whole-sequence-based plasmid typing and its utility to prediction of host range and epidemiological surveillance |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7660255/ https://www.ncbi.nlm.nih.gov/pubmed/32969786 http://dx.doi.org/10.1099/mgen.0.000435 |
work_keys_str_mv | AT robertsonjames universalwholesequencebasedplasmidtypinganditsutilitytopredictionofhostrangeandepidemiologicalsurveillance AT bessonovkyrylo universalwholesequencebasedplasmidtypinganditsutilitytopredictionofhostrangeandepidemiologicalsurveillance AT schonfeldjustin universalwholesequencebasedplasmidtypinganditsutilitytopredictionofhostrangeandepidemiologicalsurveillance AT nashjohnhe universalwholesequencebasedplasmidtypinganditsutilitytopredictionofhostrangeandepidemiologicalsurveillance |