Cargando…
Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes
The newest generation of DNA sequencing technology is highlighted by the ability to generate sequence reads hundreds of kilobases in length. Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) have pioneered competitive long read platforms, with more recent work focused on improving...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8495745/ https://www.ncbi.nlm.nih.gov/pubmed/33768248 http://dx.doi.org/10.1093/g3journal/jkab083 |
_version_ | 1784579612116779008 |
---|---|
author | Tvedte, Eric S Gasser, Mark Sparklin, Benjamin C Michalski, Jane Hjelmen, Carl E Johnston, J Spencer Zhao, Xuechu Bromley, Robin Tallon, Luke J Sadzewicz, Lisa Rasko, David A Dunning Hotopp, Julie C |
author_facet | Tvedte, Eric S Gasser, Mark Sparklin, Benjamin C Michalski, Jane Hjelmen, Carl E Johnston, J Spencer Zhao, Xuechu Bromley, Robin Tallon, Luke J Sadzewicz, Lisa Rasko, David A Dunning Hotopp, Julie C |
author_sort | Tvedte, Eric S |
collection | PubMed |
description | The newest generation of DNA sequencing technology is highlighted by the ability to generate sequence reads hundreds of kilobases in length. Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) have pioneered competitive long read platforms, with more recent work focused on improving sequencing throughput and per-base accuracy. We used whole-genome sequencing data produced by three PacBio protocols (Sequel II CLR, Sequel II HiFi, RS II) and two ONT protocols (Rapid Sequencing and Ligation Sequencing) to compare assemblies of the bacteria Escherichia coli and the fruit fly Drosophila ananassae. In both organisms tested, Sequel II assemblies had the highest consensus accuracy, even after accounting for differences in sequencing throughput. ONT and PacBio CLR had the longest reads sequenced compared to PacBio RS II and HiFi, and genome contiguity was highest when assembling these datasets. ONT Rapid Sequencing libraries had the fewest chimeric reads in addition to superior quantification of E. coli plasmids versus ligation-based libraries. The quality of assemblies can be enhanced by adopting hybrid approaches using Illumina libraries for bacterial genome assembly or polishing eukaryotic genome assemblies, and an ONT-Illumina hybrid approach would be more cost-effective for many users. Genome-wide DNA methylation could be detected using both technologies, however ONT libraries enabled the identification of a broader range of known E. coli methyltransferase recognition motifs in addition to undocumented D. ananassae motifs. The ideal choice of long read technology may depend on several factors including the question or hypothesis under examination. No single technology outperformed others in all metrics examined. |
format | Online Article Text |
id | pubmed-8495745 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-84957452021-10-07 Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes Tvedte, Eric S Gasser, Mark Sparklin, Benjamin C Michalski, Jane Hjelmen, Carl E Johnston, J Spencer Zhao, Xuechu Bromley, Robin Tallon, Luke J Sadzewicz, Lisa Rasko, David A Dunning Hotopp, Julie C G3 (Bethesda) Investigation The newest generation of DNA sequencing technology is highlighted by the ability to generate sequence reads hundreds of kilobases in length. Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) have pioneered competitive long read platforms, with more recent work focused on improving sequencing throughput and per-base accuracy. We used whole-genome sequencing data produced by three PacBio protocols (Sequel II CLR, Sequel II HiFi, RS II) and two ONT protocols (Rapid Sequencing and Ligation Sequencing) to compare assemblies of the bacteria Escherichia coli and the fruit fly Drosophila ananassae. In both organisms tested, Sequel II assemblies had the highest consensus accuracy, even after accounting for differences in sequencing throughput. ONT and PacBio CLR had the longest reads sequenced compared to PacBio RS II and HiFi, and genome contiguity was highest when assembling these datasets. ONT Rapid Sequencing libraries had the fewest chimeric reads in addition to superior quantification of E. coli plasmids versus ligation-based libraries. The quality of assemblies can be enhanced by adopting hybrid approaches using Illumina libraries for bacterial genome assembly or polishing eukaryotic genome assemblies, and an ONT-Illumina hybrid approach would be more cost-effective for many users. Genome-wide DNA methylation could be detected using both technologies, however ONT libraries enabled the identification of a broader range of known E. coli methyltransferase recognition motifs in addition to undocumented D. ananassae motifs. The ideal choice of long read technology may depend on several factors including the question or hypothesis under examination. No single technology outperformed others in all metrics examined. Oxford University Press 2021-04-19 /pmc/articles/PMC8495745/ /pubmed/33768248 http://dx.doi.org/10.1093/g3journal/jkab083 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Investigation Tvedte, Eric S Gasser, Mark Sparklin, Benjamin C Michalski, Jane Hjelmen, Carl E Johnston, J Spencer Zhao, Xuechu Bromley, Robin Tallon, Luke J Sadzewicz, Lisa Rasko, David A Dunning Hotopp, Julie C Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes |
title | Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes |
title_full | Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes |
title_fullStr | Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes |
title_full_unstemmed | Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes |
title_short | Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes |
title_sort | comparison of long-read sequencing technologies in interrogating bacteria and fly genomes |
topic | Investigation |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8495745/ https://www.ncbi.nlm.nih.gov/pubmed/33768248 http://dx.doi.org/10.1093/g3journal/jkab083 |
work_keys_str_mv | AT tvedteerics comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes AT gassermark comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes AT sparklinbenjaminc comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes AT michalskijane comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes AT hjelmencarle comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes AT johnstonjspencer comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes AT zhaoxuechu comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes AT bromleyrobin comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes AT tallonlukej comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes AT sadzewiczlisa comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes AT raskodavida comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes AT dunninghotoppjuliec comparisonoflongreadsequencingtechnologiesininterrogatingbacteriaandflygenomes |