Cargando…
Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data
The incidence of infections caused by multidrug-resistant E. coli strains has risen in the past years. Antibiotic resistance in E. coli is often mediated by acquisition and maintenance of plasmids. The study of E. coli plasmid epidemiology and genomics often requires long-read sequencing information...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8400445/ https://www.ncbi.nlm.nih.gov/pubmed/34442692 http://dx.doi.org/10.3390/microorganisms9081613 |
_version_ | 1783745316567121920 |
---|---|
author | Paganini, Julian A. Plantinga, Nienke L. Arredondo-Alonso, Sergio Willems, Rob J. L. Schürch, Anita C. |
author_facet | Paganini, Julian A. Plantinga, Nienke L. Arredondo-Alonso, Sergio Willems, Rob J. L. Schürch, Anita C. |
author_sort | Paganini, Julian A. |
collection | PubMed |
description | The incidence of infections caused by multidrug-resistant E. coli strains has risen in the past years. Antibiotic resistance in E. coli is often mediated by acquisition and maintenance of plasmids. The study of E. coli plasmid epidemiology and genomics often requires long-read sequencing information, but recently a number of tools that allow plasmid prediction from short-read data have been developed. Here, we reviewed 25 available plasmid prediction tools and categorized them into binary plasmid/chromosome classification tools and plasmid reconstruction tools. We benchmarked six tools (MOB-suite, plasmidSPAdes, gplas, FishingForPlasmids, HyAsP and SCAPP) that aim to reliably reconstruct distinct plasmids, with a special focus on plasmids carrying antibiotic resistance genes (ARGs) such as extended-spectrum beta-lactamase genes. We found that two thirds (n = 425, 66.3%) of all plasmids were correctly reconstructed by at least one of the six tools, with a range of 92 (14.58%) to 317 (50.23%) correctly predicted plasmids. However, the majority of plasmids that carried antibiotic resistance genes (n = 85, 57.8%) could not be completely recovered as distinct plasmids by any of the tools. MOB-suite was the only tool that was able to correctly reconstruct the majority of plasmids (n = 317, 50.23%), and performed best at reconstructing large plasmids (n = 166, 46.37%) and ARG-plasmids (n = 41, 27.9%), but predictions frequently contained chromosome contamination (40%). In contrast, plasmidSPAdes reconstructed the highest fraction of plasmids smaller than 18 kbp (n = 168, 61.54%). Large ARG-plasmids, however, were frequently merged with sequences derived from distinct replicons. Available bioinformatic tools can provide valuable insight into E. coli plasmids, but also have important limitations. This work will serve as a guideline for selecting the most appropriate plasmid reconstruction tool for studies focusing on E. coli plasmids in the absence of long-read sequencing data. |
format | Online Article Text |
id | pubmed-8400445 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-84004452021-08-29 Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data Paganini, Julian A. Plantinga, Nienke L. Arredondo-Alonso, Sergio Willems, Rob J. L. Schürch, Anita C. Microorganisms Article The incidence of infections caused by multidrug-resistant E. coli strains has risen in the past years. Antibiotic resistance in E. coli is often mediated by acquisition and maintenance of plasmids. The study of E. coli plasmid epidemiology and genomics often requires long-read sequencing information, but recently a number of tools that allow plasmid prediction from short-read data have been developed. Here, we reviewed 25 available plasmid prediction tools and categorized them into binary plasmid/chromosome classification tools and plasmid reconstruction tools. We benchmarked six tools (MOB-suite, plasmidSPAdes, gplas, FishingForPlasmids, HyAsP and SCAPP) that aim to reliably reconstruct distinct plasmids, with a special focus on plasmids carrying antibiotic resistance genes (ARGs) such as extended-spectrum beta-lactamase genes. We found that two thirds (n = 425, 66.3%) of all plasmids were correctly reconstructed by at least one of the six tools, with a range of 92 (14.58%) to 317 (50.23%) correctly predicted plasmids. However, the majority of plasmids that carried antibiotic resistance genes (n = 85, 57.8%) could not be completely recovered as distinct plasmids by any of the tools. MOB-suite was the only tool that was able to correctly reconstruct the majority of plasmids (n = 317, 50.23%), and performed best at reconstructing large plasmids (n = 166, 46.37%) and ARG-plasmids (n = 41, 27.9%), but predictions frequently contained chromosome contamination (40%). In contrast, plasmidSPAdes reconstructed the highest fraction of plasmids smaller than 18 kbp (n = 168, 61.54%). Large ARG-plasmids, however, were frequently merged with sequences derived from distinct replicons. Available bioinformatic tools can provide valuable insight into E. coli plasmids, but also have important limitations. This work will serve as a guideline for selecting the most appropriate plasmid reconstruction tool for studies focusing on E. coli plasmids in the absence of long-read sequencing data. MDPI 2021-07-28 /pmc/articles/PMC8400445/ /pubmed/34442692 http://dx.doi.org/10.3390/microorganisms9081613 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Paganini, Julian A. Plantinga, Nienke L. Arredondo-Alonso, Sergio Willems, Rob J. L. Schürch, Anita C. Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data |
title | Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data |
title_full | Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data |
title_fullStr | Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data |
title_full_unstemmed | Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data |
title_short | Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data |
title_sort | recovering escherichia coli plasmids in the absence of long-read sequencing data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8400445/ https://www.ncbi.nlm.nih.gov/pubmed/34442692 http://dx.doi.org/10.3390/microorganisms9081613 |
work_keys_str_mv | AT paganinijuliana recoveringescherichiacoliplasmidsintheabsenceoflongreadsequencingdata AT plantinganienkel recoveringescherichiacoliplasmidsintheabsenceoflongreadsequencingdata AT arredondoalonsosergio recoveringescherichiacoliplasmidsintheabsenceoflongreadsequencingdata AT willemsrobjl recoveringescherichiacoliplasmidsintheabsenceoflongreadsequencingdata AT schurchanitac recoveringescherichiacoliplasmidsintheabsenceoflongreadsequencingdata |