Cargando…

Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data

The incidence of infections caused by multidrug-resistant E. coli strains has risen in the past years. Antibiotic resistance in E. coli is often mediated by acquisition and maintenance of plasmids. The study of E. coli plasmid epidemiology and genomics often requires long-read sequencing information...

Descripción completa

Detalles Bibliográficos
Autores principales: Paganini, Julian A., Plantinga, Nienke L., Arredondo-Alonso, Sergio, Willems, Rob J. L., Schürch, Anita C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8400445/
https://www.ncbi.nlm.nih.gov/pubmed/34442692
http://dx.doi.org/10.3390/microorganisms9081613
_version_ 1783745316567121920
author Paganini, Julian A.
Plantinga, Nienke L.
Arredondo-Alonso, Sergio
Willems, Rob J. L.
Schürch, Anita C.
author_facet Paganini, Julian A.
Plantinga, Nienke L.
Arredondo-Alonso, Sergio
Willems, Rob J. L.
Schürch, Anita C.
author_sort Paganini, Julian A.
collection PubMed
description The incidence of infections caused by multidrug-resistant E. coli strains has risen in the past years. Antibiotic resistance in E. coli is often mediated by acquisition and maintenance of plasmids. The study of E. coli plasmid epidemiology and genomics often requires long-read sequencing information, but recently a number of tools that allow plasmid prediction from short-read data have been developed. Here, we reviewed 25 available plasmid prediction tools and categorized them into binary plasmid/chromosome classification tools and plasmid reconstruction tools. We benchmarked six tools (MOB-suite, plasmidSPAdes, gplas, FishingForPlasmids, HyAsP and SCAPP) that aim to reliably reconstruct distinct plasmids, with a special focus on plasmids carrying antibiotic resistance genes (ARGs) such as extended-spectrum beta-lactamase genes. We found that two thirds (n = 425, 66.3%) of all plasmids were correctly reconstructed by at least one of the six tools, with a range of 92 (14.58%) to 317 (50.23%) correctly predicted plasmids. However, the majority of plasmids that carried antibiotic resistance genes (n = 85, 57.8%) could not be completely recovered as distinct plasmids by any of the tools. MOB-suite was the only tool that was able to correctly reconstruct the majority of plasmids (n = 317, 50.23%), and performed best at reconstructing large plasmids (n = 166, 46.37%) and ARG-plasmids (n = 41, 27.9%), but predictions frequently contained chromosome contamination (40%). In contrast, plasmidSPAdes reconstructed the highest fraction of plasmids smaller than 18 kbp (n = 168, 61.54%). Large ARG-plasmids, however, were frequently merged with sequences derived from distinct replicons. Available bioinformatic tools can provide valuable insight into E. coli plasmids, but also have important limitations. This work will serve as a guideline for selecting the most appropriate plasmid reconstruction tool for studies focusing on E. coli plasmids in the absence of long-read sequencing data.
format Online
Article
Text
id pubmed-8400445
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-84004452021-08-29 Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data Paganini, Julian A. Plantinga, Nienke L. Arredondo-Alonso, Sergio Willems, Rob J. L. Schürch, Anita C. Microorganisms Article The incidence of infections caused by multidrug-resistant E. coli strains has risen in the past years. Antibiotic resistance in E. coli is often mediated by acquisition and maintenance of plasmids. The study of E. coli plasmid epidemiology and genomics often requires long-read sequencing information, but recently a number of tools that allow plasmid prediction from short-read data have been developed. Here, we reviewed 25 available plasmid prediction tools and categorized them into binary plasmid/chromosome classification tools and plasmid reconstruction tools. We benchmarked six tools (MOB-suite, plasmidSPAdes, gplas, FishingForPlasmids, HyAsP and SCAPP) that aim to reliably reconstruct distinct plasmids, with a special focus on plasmids carrying antibiotic resistance genes (ARGs) such as extended-spectrum beta-lactamase genes. We found that two thirds (n = 425, 66.3%) of all plasmids were correctly reconstructed by at least one of the six tools, with a range of 92 (14.58%) to 317 (50.23%) correctly predicted plasmids. However, the majority of plasmids that carried antibiotic resistance genes (n = 85, 57.8%) could not be completely recovered as distinct plasmids by any of the tools. MOB-suite was the only tool that was able to correctly reconstruct the majority of plasmids (n = 317, 50.23%), and performed best at reconstructing large plasmids (n = 166, 46.37%) and ARG-plasmids (n = 41, 27.9%), but predictions frequently contained chromosome contamination (40%). In contrast, plasmidSPAdes reconstructed the highest fraction of plasmids smaller than 18 kbp (n = 168, 61.54%). Large ARG-plasmids, however, were frequently merged with sequences derived from distinct replicons. Available bioinformatic tools can provide valuable insight into E. coli plasmids, but also have important limitations. This work will serve as a guideline for selecting the most appropriate plasmid reconstruction tool for studies focusing on E. coli plasmids in the absence of long-read sequencing data. MDPI 2021-07-28 /pmc/articles/PMC8400445/ /pubmed/34442692 http://dx.doi.org/10.3390/microorganisms9081613 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Paganini, Julian A.
Plantinga, Nienke L.
Arredondo-Alonso, Sergio
Willems, Rob J. L.
Schürch, Anita C.
Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data
title Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data
title_full Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data
title_fullStr Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data
title_full_unstemmed Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data
title_short Recovering Escherichia coli Plasmids in the Absence of Long-Read Sequencing Data
title_sort recovering escherichia coli plasmids in the absence of long-read sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8400445/
https://www.ncbi.nlm.nih.gov/pubmed/34442692
http://dx.doi.org/10.3390/microorganisms9081613
work_keys_str_mv AT paganinijuliana recoveringescherichiacoliplasmidsintheabsenceoflongreadsequencingdata
AT plantinganienkel recoveringescherichiacoliplasmidsintheabsenceoflongreadsequencingdata
AT arredondoalonsosergio recoveringescherichiacoliplasmidsintheabsenceoflongreadsequencingdata
AT willemsrobjl recoveringescherichiacoliplasmidsintheabsenceoflongreadsequencingdata
AT schurchanitac recoveringescherichiacoliplasmidsintheabsenceoflongreadsequencingdata