Cargando…

A high-throughput multiplexing and selection strategy to complete bacterial genomes

BACKGROUND: Bacterial whole-genome sequencing based on short-read technologies often results in a draft assembly formed by contiguous sequences. The introduction of long-read sequencing technologies permits those contiguous sequences to be unambiguously bridged into complete genomes. However, the el...

Descripción completa

Detalles Bibliográficos
Autores principales: Arredondo-Alonso, Sergio, Pöntinen, Anna K, Cléon, François, Gladstone, Rebecca A, Schürch, Anita C, Johnsen, Pål J, Samuelsen, Ørjan, Corander, Jukka
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8673558/
https://www.ncbi.nlm.nih.gov/pubmed/34891160
http://dx.doi.org/10.1093/gigascience/giab079
_version_ 1784615477425733632
author Arredondo-Alonso, Sergio
Pöntinen, Anna K
Cléon, François
Gladstone, Rebecca A
Schürch, Anita C
Johnsen, Pål J
Samuelsen, Ørjan
Corander, Jukka
author_facet Arredondo-Alonso, Sergio
Pöntinen, Anna K
Cléon, François
Gladstone, Rebecca A
Schürch, Anita C
Johnsen, Pål J
Samuelsen, Ørjan
Corander, Jukka
author_sort Arredondo-Alonso, Sergio
collection PubMed
description BACKGROUND: Bacterial whole-genome sequencing based on short-read technologies often results in a draft assembly formed by contiguous sequences. The introduction of long-read sequencing technologies permits those contiguous sequences to be unambiguously bridged into complete genomes. However, the elevated costs associated with long-read sequencing frequently limit the number of bacterial isolates that can be long-read sequenced. Here we evaluated the recently released 96 barcoding kit from Oxford Nanopore Technologies (ONT) to generate complete genomes on a high-throughput basis. In addition, we propose an isolate selection strategy that optimizes a representative selection of isolates for long-read sequencing considering as input large-scale bacterial collections. RESULTS: Despite an uneven distribution of long reads per barcode, near-complete chromosomal sequences (assembly contiguity = 0.89) were generated for 96 Escherichia coli isolates with associated short-read sequencing data. The assembly contiguity of the plasmid replicons was even higher (0.98), which indicated the suitability of the multiplexing strategy for studies focused on resolving plasmid sequences. We benchmarked hybrid and ONT-only assemblies and showed that the combination of ONT sequencing data with short-read sequencing data is still highly desirable (i) to perform an unbiased selection of isolates for long-read sequencing, (ii) to achieve an optimal genome accuracy and completeness, and (iii) to include small plasmids underrepresented in the ONT library. CONCLUSIONS: The proposed long-read isolate selection ensures the completion of bacterial genomes that span the genome diversity inherent in large collections of bacterial isolates. We show the potential of using this multiplexing approach to close bacterial genomes on a high-throughput basis.
format Online
Article
Text
id pubmed-8673558
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86735582021-12-16 A high-throughput multiplexing and selection strategy to complete bacterial genomes Arredondo-Alonso, Sergio Pöntinen, Anna K Cléon, François Gladstone, Rebecca A Schürch, Anita C Johnsen, Pål J Samuelsen, Ørjan Corander, Jukka Gigascience Research BACKGROUND: Bacterial whole-genome sequencing based on short-read technologies often results in a draft assembly formed by contiguous sequences. The introduction of long-read sequencing technologies permits those contiguous sequences to be unambiguously bridged into complete genomes. However, the elevated costs associated with long-read sequencing frequently limit the number of bacterial isolates that can be long-read sequenced. Here we evaluated the recently released 96 barcoding kit from Oxford Nanopore Technologies (ONT) to generate complete genomes on a high-throughput basis. In addition, we propose an isolate selection strategy that optimizes a representative selection of isolates for long-read sequencing considering as input large-scale bacterial collections. RESULTS: Despite an uneven distribution of long reads per barcode, near-complete chromosomal sequences (assembly contiguity = 0.89) were generated for 96 Escherichia coli isolates with associated short-read sequencing data. The assembly contiguity of the plasmid replicons was even higher (0.98), which indicated the suitability of the multiplexing strategy for studies focused on resolving plasmid sequences. We benchmarked hybrid and ONT-only assemblies and showed that the combination of ONT sequencing data with short-read sequencing data is still highly desirable (i) to perform an unbiased selection of isolates for long-read sequencing, (ii) to achieve an optimal genome accuracy and completeness, and (iii) to include small plasmids underrepresented in the ONT library. CONCLUSIONS: The proposed long-read isolate selection ensures the completion of bacterial genomes that span the genome diversity inherent in large collections of bacterial isolates. We show the potential of using this multiplexing approach to close bacterial genomes on a high-throughput basis. Oxford University Press 2021-12-09 /pmc/articles/PMC8673558/ /pubmed/34891160 http://dx.doi.org/10.1093/gigascience/giab079 Text en © The Author(s) 2021. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Arredondo-Alonso, Sergio
Pöntinen, Anna K
Cléon, François
Gladstone, Rebecca A
Schürch, Anita C
Johnsen, Pål J
Samuelsen, Ørjan
Corander, Jukka
A high-throughput multiplexing and selection strategy to complete bacterial genomes
title A high-throughput multiplexing and selection strategy to complete bacterial genomes
title_full A high-throughput multiplexing and selection strategy to complete bacterial genomes
title_fullStr A high-throughput multiplexing and selection strategy to complete bacterial genomes
title_full_unstemmed A high-throughput multiplexing and selection strategy to complete bacterial genomes
title_short A high-throughput multiplexing and selection strategy to complete bacterial genomes
title_sort high-throughput multiplexing and selection strategy to complete bacterial genomes
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8673558/
https://www.ncbi.nlm.nih.gov/pubmed/34891160
http://dx.doi.org/10.1093/gigascience/giab079
work_keys_str_mv AT arredondoalonsosergio ahighthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT pontinenannak ahighthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT cleonfrancois ahighthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT gladstonerebeccaa ahighthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT schurchanitac ahighthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT johnsenpalj ahighthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT samuelsenørjan ahighthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT coranderjukka ahighthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT arredondoalonsosergio highthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT pontinenannak highthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT cleonfrancois highthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT gladstonerebeccaa highthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT schurchanitac highthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT johnsenpalj highthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT samuelsenørjan highthroughputmultiplexingandselectionstrategytocompletebacterialgenomes
AT coranderjukka highthroughputmultiplexingandselectionstrategytocompletebacterialgenomes