Cargando…

Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes

The newest generation of DNA sequencing technology is highlighted by the ability to generate sequence reads hundreds of kilobases in length. Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) have pioneered competitive long read platforms, with more recent work focused on improving...

Descripción completa

Detalles Bibliográficos
Autores principales: Tvedte, Eric S, Gasser, Mark, Sparklin, Benjamin C, Michalski, Jane, Hjelmen, Carl E, Johnston, J Spencer, Zhao, Xuechu, Bromley, Robin, Tallon, Luke J, Sadzewicz, Lisa, Rasko, David A, Dunning Hotopp, Julie C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8495745/
https://www.ncbi.nlm.nih.gov/pubmed/33768248
http://dx.doi.org/10.1093/g3journal/jkab083
_version_ 1784579612116779008
author Tvedte, Eric S
Gasser, Mark
Sparklin, Benjamin C
Michalski, Jane
Hjelmen, Carl E
Johnston, J Spencer
Zhao, Xuechu
Bromley, Robin
Tallon, Luke J
Sadzewicz, Lisa
Rasko, David A
Dunning Hotopp, Julie C
author_facet Tvedte, Eric S
Gasser, Mark
Sparklin, Benjamin C
Michalski, Jane
Hjelmen, Carl E
Johnston, J Spencer
Zhao, Xuechu
Bromley, Robin
Tallon, Luke J
Sadzewicz, Lisa
Rasko, David A
Dunning Hotopp, Julie C
author_sort Tvedte, Eric S
collection PubMed
description The newest generation of DNA sequencing technology is highlighted by the ability to generate sequence reads hundreds of kilobases in length. Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) have pioneered competitive long read platforms, with more recent work focused on improving sequencing throughput and per-base accuracy. We used whole-genome sequencing data produced by three PacBio protocols (Sequel II CLR, Sequel II HiFi, RS II) and two ONT protocols (Rapid Sequencing and Ligation Sequencing) to compare assemblies of the bacteria Escherichia coli and the fruit fly Drosophila ananassae. In both organisms tested, Sequel II assemblies had the highest consensus accuracy, even after accounting for differences in sequencing throughput. ONT and PacBio CLR had the longest reads sequenced compared to PacBio RS II and HiFi, and genome contiguity was highest when assembling these datasets. ONT Rapid Sequencing libraries had the fewest chimeric reads in addition to superior quantification of E. coli plasmids versus ligation-based libraries. The quality of assemblies can be enhanced by adopting hybrid approaches using Illumina libraries for bacterial genome assembly or polishing eukaryotic genome assemblies, and an ONT-Illumina hybrid approach would be more cost-effective for many users. Genome-wide DNA methylation could be detected using both technologies, however ONT libraries enabled the identification of a broader range of known E. coli methyltransferase recognition motifs in addition to undocumented D. ananassae motifs. The ideal choice of long read technology may depend on several factors including the question or hypothesis under examination. No single technology outperformed others in all metrics examined.
format Online
Article
Text
id pubmed-8495745
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-84957452021-10-07 Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes Tvedte, Eric S Gasser, Mark Sparklin, Benjamin C Michalski, Jane Hjelmen, Carl E Johnston, J Spencer Zhao, Xuechu Bromley, Robin Tallon, Luke J Sadzewicz, Lisa Rasko, David A Dunning Hotopp, Julie C G3 (Bethesda) Investigation The newest generation of DNA sequencing technology is highlighted by the ability to generate sequence reads hundreds of kilobases in length. Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) have pioneered competitive long read platforms, with more recent work focused on improving sequencing throughput and per-base accuracy. We used whole-genome sequencing data produced by three PacBio protocols (Sequel II CLR, Sequel II HiFi, RS II) and two ONT protocols (Rapid Sequencing and Ligation Sequencing) to compare assemblies of the bacteria Escherichia coli and the fruit fly Drosophila ananassae. In both organisms tested, Sequel II assemblies had the highest consensus accuracy, even after accounting for differences in sequencing throughput. ONT and PacBio CLR had the longest reads sequenced compared to PacBio RS II and HiFi, and genome contiguity was highest when assembling these datasets. ONT Rapid Sequencing libraries had the fewest chimeric reads in addition to superior quantification of E. coli plasmids versus ligation-based libraries. The quality of assemblies can be enhanced by adopting hybrid approaches using Illumina libraries for bacterial genome assembly or polishing eukaryotic genome assemblies, and an ONT-Illumina hybrid approach would be more cost-effective for many users. Genome-wide DNA methylation could be detected using both technologies, however ONT libraries enabled the identification of a broader range of known E. coli methyltransferase recognition motifs in addition to undocumented D. ananassae motifs. The ideal choice of long read technology may depend on several factors including the question or hypothesis under examination. No single technology outperformed others in all metrics examined. Oxford University Press 2021-04-19 /pmc/articles/PMC8495745/ /pubmed/33768248 http://dx.doi.org/10.1093/g3journal/jkab083 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
Tvedte, Eric S
Gasser, Mark
Sparklin, Benjamin C
Michalski, Jane
Hjelmen, Carl E
Johnston, J Spencer
Zhao, Xuechu
Bromley, Robin
Tallon, Luke J
Sadzewicz, Lisa
Rasko, David A
Dunning Hotopp, Julie C
Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes
title Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes
title_full Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes
title_fullStr Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes
title_full_unstemmed Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes
title_short Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes
title_sort comparison of long-read sequencing technologies in interrogating bacteria and fly genomes
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8495745/
https://www.ncbi.nlm.nih.gov/pubmed/33768248
http://dx.doi.org/10.1093/g3journal/jkab083
work_keys_str_mv AT tvedteerics comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes
AT gassermark comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes
AT sparklinbenjaminc comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes
AT michalskijane comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes
AT hjelmencarle comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes
AT johnstonjspencer comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes
AT zhaoxuechu comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes
AT bromleyrobin comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes
AT tallonlukej comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes
AT sadzewiczlisa comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes
AT raskodavida comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes
AT dunninghotoppjuliec comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes